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Preface to the first edition 



Space and time are the two most fundamental concepts in our world 
because all else is unimaginable without assuming that space (or time) 
exists. It is therefore not surprising that the sophisticated Euclidean 
model of space already existed more than 2000 years. For centuries it 
was a common belief by scientists and philosophers alike that the Eu- 
clidean structure of space was one of the very few eternal truths, It 
was only at the beginning of the 20th century that this belief was shat- 
tered with the introduction of Albert Einstein’s theories of special and 
general relativity. Today, Einstein’s theory of general relativity is com- 
pletely established, and there are many textbooks which explain it at 
all levels of mathematical sophistication. What is missing, however, is 
a modern textbook on general relativity for mathematicians and math- 
ematical physicists with emphasis on the physical justification of the 
mathematical framework. This book aims to fill this gap. 

Knowledge of physics is not assumed. While physical and heuristic 
arguments are given, they are not used as substitutes for any proofs. The 
book is also suitable as an introduction to pseudo-Riemannian geometry 
with emphasis on the intuition for geometrical concepts. 



The physical theme of the book 

Modern textbooks on general relativity typically start with a more or less 
formal introduction to pseudo-Riemannian geometry. In such textbooks 
some knowledge of special relativity is usually assumed, and the reader 
is expected to accept the geometrical framework presented on trust. This 
approach is very economical but obscures the extent to which classical 
general relativity succeeds in describing our universe, and also where it 
may fail. This is a point that is of particular relevance to those attempt- 
ing to quantise gravity. Prom a physical point of view it is important to 
realise which parts of the theory reflect genuine physical insights, and 
which are dispensible. One way this can be achieved is through a criti- 
cal introduction that stresses foundational matters. There are no modern 
textbooks taking this approach, and I hope to fill this gap with my book. 




VIII Preface to the first edition 



One of the most exciting aspects of general relativity is the predic- 
tion of black holes and the Big Bang. Such predictions gained weight 
through the singularity theorems pioneered by Penrose. In various text- 
books on general relativity singularity theorems are presented and then 
used to argue that black holes exist and that the universe started with 
a big bang. To date what has been lacking is a critical analysis of what 
these theorems really predict.^ We give a proof of a typical singular- 
ity theorem and use this theorem to illustrate problems arising through 
the possibilities of “causality violations’’ and very weak “shell crossing 
singularities”. These problems add weight to the point of view that the 
singularity theorems alone are not sufficient to predict the existence of 
physical singularities. 



The mathematical theme of the book 

In order to gain both a solid understanding of and good intuition for any 
mathematical theory, one should try to realise it as a model of a famil- 
iar non-mat hematical concept. Physical theories have had an especially 
important impact on the development of mathematics, and conversely 
various modern physical theories require rather sophisticated mathemat- 
ics for their formulation, Today, both physics and mathematics are so 
complex that it is often very difficult to master the theories in both sub- 
jects. However, in the case of pseudo- Riemannian differential geometry 
or general relativity the relationship between physics and mathematics 
is especially close, and it is therefore possible to profit from an interdis- 
ciplinary approach. 

Euclidean geometry had its origins as the description of shapes in 
physical space. It is generally considered a mathematical discipline rather 
than a physical theory, because it is possible to derive it from a small set 
of physical postulates, which can alternatively be viewed as mathematical 
axioms. Since the concept of space is basic to our everyday experience, 
Euclidean geometry combines mathematical rigor with intuitiveness — 
a combination which has proved to be extremely fruitful for both math- 
ematics and physics. Riemannian geometry is abstracted from the study 
of surfaces in Euclidean space and inherits much of the intuitiveness of 
Euclidean geometry. Hence Riemannian geometry is very well developed, 
and a growing number of geometers have branched out to Lorentzian or 
even pseudo-Riemannian geometry. In my experience, these fields (and 

^ Since I had written this passage a review article (Senovilla 1998) which 
has a very similar theme has been pointed out to me. This article pro- 
vides many very illuminating examples of spacetimes as well as discussions 
which reinforce our sceptical approach towards the physical interpretation 
of singularity theorems. 
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IX 



even Riemannian geometry) appear quite abstract to the majority of 
students. 

A careful analysis of space, time, and free fall — the most funda- 
mental (classical) physical concepts — leads almost automatically to 
Lorentzian geometry. With respect to Lorentzian geometry, we are there- 
fore in a similar situation as ancient geometers were with respect to 
Euclidean geometry. What’s more, virtually no physical background is 
required for this approach. Since Riemannian geometry comes to play in 
the study of submanifolds representing an instant in time, it is completely 
straightforward to extrapolate pseudo-Riemannian geometry from the 
special and physically motivated cases of Lorentzian and Riemannian 
geometry. 

While some modern textbooks present pseudo-Riemannian geometry 
(and general relativity) to mathematicians (an example of this is that 
by O’Neill (1983)), they have not motivated the geometry from basic 
properties of space and time. Instead they have developed it as an ab- 
stract mathematical theory. To ensure that the mathematical description 
mirrors the physical concepts, all definitions have a justification in this 
book. This approach also leads to a careful treatment of the structural 
aspects of the mathematics. 



How to read this book 

This book is not designed so that it is necessary for the reader to start 
at page 1 and then to read on until she or he arrives at page 424. People 
who take this approach will very likely give up before they reach page 
14! The material is ordered in such a way as to allow the text to be 
used as a reference source. It is an unfortunate fact that many parts of 
the theory that logically belong to the preliminaries are not of imme- 
diate interest to a reader who is interested in space and time, and so 
the reader is urged to follow the guides in the margins, which provide 
a shortcut. As an example, the text in the margin denotes the begin- p. iii i 
ning of a passage belonging to the shortcut: p. Ill | denotes the page 
number where the last shortcut passage ended and [| p* 222] the page — 

number where the present passage will end. Additional explanations in 
the footnotes are indicated by ~-^2, where 2 refers to the number of the 
corresponding footnote. The end of shortcut passages is marked simi- 
larly. Having understood the material leading to Einstein’s equation it 
is then not difficult to return to the parts that have been skipped on 
an earlier reading. In addition, hints are given at the beginning of most 
sections as to what is important and should be read . 



^ Explanations referring to the guide in the margin. 
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This book, with its 424 pages is meant to cover both general relativity 
and pseudo-Riemannian differential geometry. It is therefore clear that 
some important topics had to be omitted. 

For mathematicians, the most important omissions are certainly some 
topics peculiar to Riemannian geometry, such as the Hopf-Rinow the- 
orem (O’Neill 1983, Theorem 5.21) and the Myers theorem (O’Neill 
1983, Theorem 10.24). Because these results are contrary to intuition 
one should obtain for Lorentzian (or general pseudo-Riemannian) geom- 
etry and since they are not needed for the description of space and time, 
they have been omitted from this book. 

Physicists may find that the presentation of this book is only loosely 
linked to other physical theories. This loose linkage is possible since the 
theory of space and time is fundamental to any other physical theory. 
The book is therefore accessible to mathematicians and physicists alike. 
Physicists who are interested in applications to astrophysics may wish to 
consult the book by Weinberg (1972). Weinberg’s approach is opposite 
to the one used in this book, and personally I believe that it should 
ideally be read after the reader has a solid knowledge of the conceptional 
aspects of relativity as presented in this book. Most other books on 
general relativity also present the “Kerr solution” , which is supposed to 
model the exterior of a rotating black hole. It has been omitted since 
it is not essential to understanding general relativity. Moreover, it is 
well described in other books. People interested in this solution should 
probably first read Chap. 12 of the book by Wald (1984). The purely 
mathematical aspects of this solutions are clearly presented in O’Neill’s 
book (1995). 
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This book is dedicated to two Australian relativity students who on their 
way to gaining their doctorates courageously stood up against the 
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1. Local theory of space and time 



This book is not meant to be read in the order the 
material is presented. Please follow the guide in the 
margins or skip material as proposed in the italic text 
at the beginning of most sections. 



In this chapter we will develop those aspects of space and time which 
can be locally observed, say in a laboratory. We will start with Euclid’s 
description of space and then incorporate time in to the picture. The 
path we take is rather historical. It starts with intuitive but surprisingly 
complicated concepts (Newton’s theory of absolute space and absolute 
time) and ends with the not so intuitive but mathematically simpler 
theory of special relativity. The guiding principle of this book will not 
be mathematical simplification, but the solution of problems occurring 
in earlier theories. 

The mathematical description in this chapter seems to be global and 
leads to extrapolations which are not validated by any experiments and 
which are not generally true. In the following chapters we will take up this 
point again, and show that the description given in this chapter should 
be considered infinitesimally rather than globally. This is the theme of 
the book. 



p. 1 I 

-^1 

[X P- 3] 



1.1 Space 

In this section we consider space and introduce Euclidean geometry. This 
material is assumed to be familiar to the reader and is therefore presented 
in a rather concise way. 



^ Readers who wish to learn the essentials of the theory of space and time 
quickly and do not mind skipping some mathematical proofs can use the 
guide in the margins. 



M. Kriele: LNPm 59, pp. 1 - 1, 1999 
© Springer- Verlag Berlin Heidelberg 1999 






2 



1. Local theory of space and time 



1.1.1 Affine space 

In this section we introduce affine space as our most elementary de- 
scription of space. Affine space is just R" where the special properties 
o/ 0 € R” are ignored.^ 

It is a basic experience that we can uniquely describe any point in space 
by three real numbers. This seems to be the idea of Descartes (1637) 
who developed analytic geometry as an example of his Discours de la 
Methode. While it is therefore plausible to identify with (physical) 
space, contains a distinguished point 0 whereas space apparently does 
not. Hence by using as a description of space we introduce a math- 
ematical structure which has no physical counterpart. This would lead 
to constructions which cannot be realised in space. For instance, there is 
the unique negative of a vector u e R^ but there is no way to assign the 
negative to a point in space. As another example, addition of vectors has 
no direct interpretation in terms of points in space. If we want to have a 
reliable description of space with the property that all phenomena exhib- 
ited in this description are mirrored by physically verifiable phenomena, 
we have to abstract from these additional structures. 

We will now isolate those structures of R^ which have an intuitive 
meaning in terms of space. Given two points x, y we can construct an 
arrow v which points from x to y. This arrow induces a map from space to 
space. We just move the arrow (without rotating) such that its untipped 
end coincides with a given point 2 . The point 2 : is then mapped to the 
tip of the arrow. It appears that — as long as we don’t rotate v — this 
definition is independent of the path which we use to move v from x to 
z. In R®, this parallel transport is just given by the map : R^ R^, 
z z + y - X. Observe that y-x stands here for the arrow v which is 
not a point. Thus the geometric interpretation is different from a simple 
addition of vectors. In order to separate the concepts involved we define 
the concept of real affine space. 

We do not yet know what “rotating” should mean in mathematical 
terms — so far we have simply a physical picture in mind. However, the 
following definition (for the n-dimensional case) reflects some properties 
of our naive notion of parallel transport and does not introduce any 
additional properties. The set of points is denoted by A”. 

Definition 1.1.1. An n-dimensional real affine space is a set A” and 
a collection {R„ : A” ^ A" : v € R"} of bijective maps such that the 
following holds, 

(i) Rv+w = RyoRuj Vv, w € R", 

^ Readers who already have some knowledge of manifold theory (cf. Chap. 2) 
and connections (cf. Sect. 2.6) can take affine space to be the usual R” 
considered as a manifold together with the flat connection. 
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(ii) for all x,y e there exists a unique such that 

Rv,.,y{x) = y. 

We denote Rv{x) by x -{‘V or v x and by y—x or xy. 

We will see below that this definition is so restrictive that it does not 
allow for more affine spaces than there are vector spaces. 

Remark LLL Of course, space is 3-dimensionaL We work with general 
dimension n for several reasons. Firstly, most of the theory we will be 
developing does not depend on the dimension — with the exception of 
a key-result whose proof, however, is too complicated to be presented in 
this book (cf. Theorem 5.3.1). Secondly, it is often convenient to compare 
the theory with lower dimensional analogues which are easier to visualise. 

It is therefore advisable to formulate it in a way which encompasses these 
analogues and also shows the restriction of the analogy. Thirdly, there 
exist modifications of Einstein’s general theory of relativity to higher 
dimensions (“Kaluza-Klein theories”). 

Another way to generalise the theory would be to allow for complex 
vector spaces as well. We refrain from doing so since the main result of 
Section 1.1.2 only holds for affine spaces over M. 

Note that for x € A^, the associative law 

X {u-\rv) = {x -\-u) -^v 

holds. It is easy to see that all n-dimensional real affine spaces are 
isomorphic, and can be realised by in the following way. Choose 
any o € and define (/)o : A’^ — ^ R’^, x Now iden- 

tify with 4>q{A^) and define for x G ^o(A^), v ^ R^ the bisection 
ku{x) = (j)o{Rv{4>o~^{x)))- Clearly, Observe that 

these definitions are independent of the arbitrarily chosen point o. Con- 
versely, choosing an o € A’^ we can recover the structure of R^ by iden- 
tifying o with the zero vector 0. | j p. 14 

1*1.2 The fundamental theorem in affine geometry 
and doubly ruled surfaces 

In this section we present some results of affine geometry which will 
be needed in the proof of Theorem 1.4>T This section is very technical 
and should be omitted on first reading. 

Let a, xi, . . . , x/c € and . . . , € R such that = 1- Then 

the bary centre with masses a^, . . . , a^, 

^ Section 1.1.2 is needed for the proof of Theorem 1.4.1 which is central to our 
interpretation of the Michelson-Morley experiment. However, the reader is 
strongly advised against reading this part now. 
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k 

a^xi+a‘^X2+ . . . +a^Xk + a%Xi—o), 

i=l 

is independent of o and therefore an affine invariant. The symbol + is 
defined via the right hand side and can only be applied to “linear com- 
binations” where the real factors add to 1 . An affine subspace B of 
is a set of points {x = a^xi+a'^X 2 + . . . +a^Xk : = 1 }, where 

xi,..,,Xk eive pairwise different, fixed points. The affine dimension of B 
is A: - 1 . It follows that an affine subspace is an affine space. An affine 
subspace of dimension 1 is called an affine line. We call points lying on 
a single line collinear. Observe that lines are the smallest sets which are 
invariant under parallel transport. 

Lemma 1 . 1 * 1 . Let x,y,z ^ Then x, y, z lie on an affine line if and 
only if there exists a A G M such that x = y + X{z—y), 

Proof X lies on the line generated by y, ^ if and only if there exists an ^ G 
Mwithx = /3y+(l-/?)^-y+/5(y-y) + (l-^)( 2 :-y) =y + (l~^)(;^-y). 



Definition 1 . 1 , 2 . An affine map is a map f:A^ f(x) ~ 

A{x—o) 4- b, where A is a linear map^ o G A’^, and b G MP. If A is 
bijective then f is called an affine transformation. 

A collineation is a bijection f:A^ which maps any three 

collinear points into collinear points. 

Consider a line I and three points xi,X 2 ,xs on L Then the number A 
given by X 3 — xi = A(x 2 ^xi) is denoted by 

^3-xi 
X2 — Xi ’ 

The following lemma is the classical theorem of Thales. It will be used in 
the proof of the fundamental theorem in affine geometry (Theorem 1.1,1 
below) . 

Lemma 1.1.2. Let C be parallel hypersurfaces and I be 

a line which intersects these hypersurfaces. Let Xi{l) — Hif]l. Then 

^s{l)-xi{l) 

^2{l) — Xi{l)^ 

does not depend on L 

Proof. Denote by H the subspace of which is the associated vector 
space to the affine space Hi (and since Hi,H 2 ,Hs are parallel also to 
H 2 y Hs)- We consider the quotient space A^/H defined by 
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X y if and only if y — x ^ H. 

This space has a natural affine structure with associated vector space 
W^/H given by 7r(n:)— 7 t(^) = 7f{x'—z) where tt, tt denote the projections 
to the equivalence classes. We have 



7r(x3(i))-7r(a;i(0) = Tf{x3{l)-xi{l)) 
f x3{l)-xi{l) 

\X2{l)-Xi{l) 
_ X3{l)-Xl{l) 



which implies that 



{X2{l)-Xl{l)) 



X2{l)—Xi{l) 

a:3(0-xi(0 

X2{l)-Xl(l) 



ii{x2il)-xi{l)) 

(tt{x2{1))—it{xi{1))) 



X3{l)-Xi{l) ^ Tr{x3{l))-Tl{xi{l)) 

X2{l)-Xi{l) Tr{x2{l))~n{xi{l)) 

only depends on the projected values. Now it is sufficient to observe 
that Tr{xi{l)) is independent of I since all points in Hi are equivalent: 
x,y € Hi=^ 7 t{x) = 7r(y). I 



It is easy to see that all bijective, affine maps are collineations. Con- 
versely, the fundamental theorem in affine geometry asserts that any 
collineation must be affine: 

Theorem 1.1.1. Let be an affine space over M with n > 2 and fix 
o € A. Let A^ be a bijection which takes each three collinear 

points into collinear points. Then there exists a point b ^ and an 
invertible linear map f such that f{x) = /(rr— o) + b for all x € 

The proof is elementary but lengthy and requires some preparatory lem- 
mas. We will follow (Berger 1987, p. 52-55) where one can also find a 
version of this theorem which holds in the complex case. Observe that 
the following proof makes heavy use of the assumption n > 2. The the- 
orem does not hold for n = 1 since in this case any map maps collinear 
points into collinear points. 

Lemma 1.1.3. Let o^xi^ . . , ^Xk G A^, f be a collineation, A^, . . . , 6 

M, and 

k 

X = o + ^ A^ {Xi—o) e A’^, 
i=l 

Then there exist /i^, . . . , 6 M such that 

k 

fix) = /(o) + ifixi)-f{o )) . 

i—1 
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Proof. For A: = 1 the claim is clear by the definition of a collineation. 
Assume now, the assertion is true for all m € {1, . . . , /c - 1}. For 

m+l m 

X ~ 0 + ^ A'^ {xi—o) let x' = o-{- ^ ( 0 :^— 0 ) . 

i=l 



Then we have 

X^~^^{Xm^i—o) ( 1 , 1 . 1 ) 

and by induction hypothesis there are real numbers , . . . , with 
/(x') - f{o) = ~ /(^))- We define also 

y =: o + A"^+^(rr^+i-o), (1.1.2) 

^ 

The triples {z,x^ ,y}^ {y, o,Xm+i], and {z,o, a:} consist each of collinear 
points. This is clear for the first triple and follows from Lemma 1.1.1 
for the second triple. To see this for the third triple observe that y—o = 
x—x\ z = |y+^x' is the centre of the parallelogram defined by 0 , y, x, x' 
and therefore the intersection of the line connecting y with x' and the 
line connecting o with x. Since each of these three triples consists of 
collinear points there exist such that 

/(z) = af{x')+{l - a)f{y), 
f{x) = pf{o)+(l - P)f{z), 
f{y) = /(o)+7(/(a;m+i)-/(o))- 



This implies 

f{x)=Pf{o)+{l-P)f{z) 

= P{f{ 0 )-f{ 0 )) + (1 - P){f{z)-f{o)) + f{ 0 ) 

= (1 - P){{af{x')+{1 - a)f{y))-f{o)) + /(o) 

= (1 - P){a{f{x')-f{o)) + (1 - a){f{y))) + f{o) 

m 

= (1 - p){a ^ m'* {f{xi)-f{o)) 

i~l 

+ (1 - a)'</{f{Xm,+l) - f{o))) + /(o) 

m+l 

= X! {f{x:i)-f{o)) + f(o). 
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Lemma 1.1.4. Let such that {xi— o, . . . ,Xn—o} is a 

basis ofMP. If f is a collineation then {/(a:i)— /(o), . . . , f{Xn)—f{o)} is 
also a basis ofW^, 

Proof Let x G he any point and let x = f~^{x). Since {xi—o, ...,Xn~ 
o} is a basis of there exist ^ R such that x—o=^ <?)• 

Lemma 1.1.3 implies that there exist /i\ . . , , € R such that 

k 

X - f{o) = fix) - f{o) = ifixi)-f{o)) . 

Since x was arbitrary the assertion follows. I 



Lemma 1.1.5. A bisection f is a collineation if and only if it maps 
affine lines onto affine lines. 

Proof. Let x,y G A^ and denote by I the line spanned by these points. 
Let i be a point on the line spanned by /(x), f{y). We have to show 
that = f~^{z) G L If this was not true than the vectors z—x^y—x 
would be linearly independent. But then Lemma 1.1.4 would imply that 
f{z)—f{x), f{y)—f{x) where linearly independent as well. Contradic- 
tion to the construction of z = f{z) I 



Lemma 1.1.6. Let f be a collineation. Then f maps parallel lines into 
parallel lines. 

Proof. Let I, I he two parallel lines (which do not coincide — otherwise 
there would be nothing to prove). Since they are parallel they span a 
plane P rather than a 3-dimensional subspace of A^. 

This plane is mapped into a plane P\ In order to see this consider 
a line I such that the lines I, I intersect and span P. It is clear that any 
line which intersects_both I and I is contained in P. Moreover, any point 
y G P lies on a line I which intersects both I and 1. Let P' be the plane 
generated by the (intersecting) lines f(l) and f(l). f(y) lies on the line 
f{T) which intersects f{l) and f{l). Hence f(J) (and therefore f{y)) lies 
in P'. 

Having established that /(P) is a subset of a plane we only have to 
show that f(l)nf{l) = 0. If there was a point ^ e f{l)f)f{l) then f~^{z) 
would lie in both I and L which is impossible since both lines are parallel. 

I 



Lemma 1.1.7. Letk: R — ^ R an automorphism, i.e., k{a(3) ~ k{a)k{fi) 
and fc(aH-/?) = k{a) + k{P) for all real numbers a, P . Ifkj^O then A: = id 




8 



L Local theory of space and time 



Proof. fc(0) = /c(0 + 0) = fc(0) + fc(0) implies fc(0) = 0. Assume, there is an 
a^O with k{a) — 0. Then k{P) — k{a)k{P/a) - 0 for all P and k must 
vanish. Hence k{a) ^ 0 Va ^ 0. ^(1) = k{l • 1 ) = k{l)k{l) implies 
fc(l) = 1 . By induction we obtain fc(n) = n for all natural numbers. 
fc(~n) = fc (0 - n) = fc(0) - k{n) = ~-k{n). Similarly, we have k(l/n) = 
l/k{n) = 1 /n. For n, m € Z we have now k{n/m) = njm and the lemma 
is proved for all rational numbers, a < p implies k{a) < k(p) since for 
any positive number 7^ we have ^(7^) = ^(7)^(7) > 0 . Let now 7 be any 
number. Then there exists a monotonically increasing sequence 7 
of rational numbers and likewise a monotonically decreasing sequence of 
rational numbers pi 7. Hence = k{ai) < k{'y) < k{Pi) = Pi which 
implies fc(7) = 7. I 

Observe that this lemma would be false if we had replaced M by C as 
z would be a counter example. This is why theorem 1,1.1 (as stated 
above) is not true for afBne spaces over the field C. 

Proof of Theorem 1 . 1 . 1 . Let /: R^, v f{v) = f(o + v)—f{o). 

The idea of proof is_^to construct an automorphism fc : R R such that 
f{Xv-\-fiw) = k{X)f{v) -j-k{^)f{w) holds for all A,/i e R and v^w e R^. 
We will use constructions based on parallel lines in order to represent 
vectors such as + tc, (A + jj.)v^ Xfiv. Since / maps parallel lines into 
parallel lines (Lemma 1 . 1 . 6 ) these constructions will be preserved by / 
and can therefore be used in order to prove linearity and multiplicativity 

We will first show that / is additive. 



/ 




Let e R^ and consider the lines spanned by 0^0 -^v and 
0,0 + w. The point o-bv-^w is the intersection of the parallel translation 
of Iw that contains o-\-v and of ly that contains o-bw (cf. Figure 1 . 1 . 1 ). 
Since parallel lines are mapped into parallel lines we know that f{o + 
V -bw) is constructed analogously from /(o), f{o -f u), f{o + u;). Hence 
f{v + w) = f{o-bv-\-w)-f{o) === f(o+v-{-w)-f{o-\-v)^flo+v)~f{o) = 
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f {o ^ w)— f {o) + f (o + v)— f (o) — f{w) -i- f{v). Here we have used the 
fact that the vectors connecting f{o) with f{o + w) and f{o + v) with 
f{o^v + w) are identical since they correspond to opposite sides of a 
parallelogram in a plane. 

Now we show that there is a well defined automorphism k: R R 
such that f{Xv) = k{X)f{v) for all u G and A € E. We first fix a vector 
V and consider the line I through o spanned by v. Denote hy gi: I R 
the map o + A and by the map f{o) + fif{v) fx. Since / 

maps the line through o which is spanned by v into the line through f{o) 
which is spanned by f{o + v)—f{o) the map A:: E — > M is well defined 
through the relationship f{Xv) = k{X)f{v). From 

f{o) + k{X)f{v) = f{o) + /(Av) = f{o + \v) = f{gi^{X)) 
we see that k is given by k{X) = gf(j^ o / o 



/ 




Fig. 1.1.2. Additivity of k 



In order to prove additivity of k we use the fact that {X+ix)v — Xv-h/xv 
can be constructed using parallel lines (cf. Figure 1.1.2) Let w he 
linearly independent from v and consider the triangle defined by the 
points o, o + o Xv. This triangle can be parallely translated so 
that the point o is mapped into o + jxv, (We simply parallely translate 
the lines generated by its sides as indicated in the figure). Since this 
translation preserves the vectors defined by the sides of the triangle we 
have obtained a geometric construction of the point o + A^; 4- Xw, Since 
this construction only employs intersection points and parallel lines it is 
preserved by the map/. Hence we obtain /((A + /i)t;) = f{Xv)-\-f{}xv) = 
k{X)f{v) + k{g)f{v) and therefore 

k{X + /i) = gfg^) o / o g'[^{X + /i) - o /(<? + (A + ix)v) 

^ 9f{i){f{o) ^ f{{X^ ix)v)) 

= 9m{f{o) 4 k{X)f{v) 4 k{fx)f{v)) 
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= 9mifio) + (fc(A) + k{n))f{v)) = k{\) + k{jj,). 



f 




The proof of multiplicativity is similar and employs a slightly dif- 
ferent geometrical construction (cf. Figure 1.1.3) which is justified by 
Lemma 1.1.2. The configuration in the first part of Figure 1.1.3 lies in 
a plane whence hyper surfaces are simply lines. Denote by H 2 the line 
which connects o + v with o T tu, by i?i its parallel translation through 
o, and by its parallel translation through o + Au. Further denote the 
line through o and o -\-v hy I and the line which connects o with o w 
by V . Using the notation of Lemma 1.1.2 we have 

\ (o + Au)— o _ 

(o + v)—o X 2 {l)—xi{l ) ' 

Hence Lemma 1.1,2 implies that the intersection of and I' is really 
o + Xw as depicted in the figure. We apply this lemma a second time 
where the three parallel hypersurfaces are now given by the 

line connecting 0 + /xu with 0 + uj, its parallel translation through o, and 
its parallel translation through o -\- Xw. It follows that the intersection 
of i ?3 with I is o + yu(Au) = o + A/iu. Since this construction only em- 
ploys intersections and parallel lines it is preserved by / and we obtain 
f(Xfj>v) ~ k{X)k{fj)f{v). This implies 

k{Xn) = gf(i) o /(o + \g.v) = 9 f(i){f{o) + f{Xnv)) 

= 9f(.l){f{o) + k{X)k{fi)f{v)) = k{X)k{iJ,). 

Hence k is really an automorphism of the real line. One can geo- 
metrically show that this automorphism neither depends on v nor on 
o. However in our case this automorphism is trivially well defined since 
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we already know that the only non-zero automorphism of R is the iden- 
tity. This also implies f{\v) = Xf(v) for all A e R, v € R”. Hence the 
theorem is proved. | 

We will now turn our attention to special subsets of affine space which 
become important in the proof of Theorem 1.4.1. 

Definition 1.1.3. Let U C be an open set and x: U ^ A'^ be a 
map such that at each point (s,t) e U the differential is 

injective. Then x is called an immersed surface. If x is also injective 
then it is simply called a surface. 

A surface should be envisaged by its image, a two-dimensional, smooth 
subset. An immersed surface may have self-intersections. 

Since lines have such a fundamental meaning in affine geometry, sur- 
faces which are generated by lines are of special interest. 

Definition 1,1.4. A ruled surface is a surface which can be parametri- 
zed by a function of the form x{s, t) = c{s) -h tw{s). Such a parameteri- 
sation is called a ruling of the surface. 




Fig. 1.1.4. A ruled surface 



Example 1.1.1. The simplest ruled surfaces are those which admit two 
different rulings. 

(i) A trivial example would be any plane. 

(ii) A slightly more sophisticated example is given by the rotational 
hyperboloid. Let c(s) = (cos(5), sin(s), 0)"^ be the unit circle in 

and consider arhyp(5,t) = c(s) ^t{c{s) + (0,0,1)"^). Clearly, x 
is a ruled surface and explicitly given by x\^y^{s,t) = (cos( 5 ) - 
t sin(s), sin(s) cos(s), t)"*~ . Since it satisfies the equation + 

(^hyp)^ ~ (^hyp)^ == 1 it must be a rotational hyperboloid. The same 
surface is described by Xhyp(^, t) = c{s)+t{-c{s) + (1,0,0)^) which 
is a different ruling. 
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(iii) A third example is given by the hyperbolic paraboloid. Let 
c(s) = (SiOjO)"^ and . Then Xpar(s,i) = 

c(s) + tw(s) satisfies = ^par ^par parameterises a 

hyperbolic paraboloid. We can interchange and Xpa, to ob- 
tain a difl?erent ruling of the same surface, Xp^r (s,t) — (0, s, 0)^ -I- 

Theorem 1.1. 2* Let M C A.^ be a surface which admits two different 
rulings. Then — up to an affine transformation — M is a subset of 
either a plane, a rotational hyperboloid, or a hyperbolic paraboloid. 

Proof, It is easy to see that any surfaces M C with two rulings can 
locally be embedded into A^. One just has to consider a line h of the 
first ruling which intersects a line I 2 of the second ruling. Choose another 
line ^3 of the second ruling which also intersects l\. Then all three lines 
span a 3-dimensional affine subspace. At least locally, any further line of 
the first ruling must intersect both I 2 and ig whence it is contained in 
the same affine subspace. Since M is generated by the lines of the first 
ruling we have proved the assertion. 




Fig. 1.1,5. Proof of Theo- 
rem 1.1.2 — first case 



If there are any two generators of the first ruling which lie in a plane 
then the ruled surface must be this plane. Hence we can assume that any 
two generators are linearly independent. There are now two possibilities. 
Either there exist three generators which are all parallel to a single plane 
or any three generators are linearly independent. 

In the first case let h, I 2 , h be different generators of the first ruling 
which are all parallel to a single plane. We can now find linear coordinates 
such that the x^-Axis coincides with li and the x^-axis is 
parallel to h- By choosing the origin appropriately, li is given by x^ = 
= 0 , 12 by x^ =0,x^ = aS and k is given by x^ - x^ = 0, 

where e M. Let P be a two-plane which contains h. Then there 

exists an s € R such that P is given by x^ -h sx^ =0. Any generator I' 
of the second ruling which is contained in P must intersect both I 2 and 
Jg. We obtain for the intersection points: 
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Fig. 1.1.6. Proof of Theo- 
rem 1.1.2 — second case 



{P n I2} : =0, — -a^s, = a^; 

{P n ^3} : x^ = a^a^s, ~ -a^s, = a^. 



It follows that r has the parameter form 
0 



a^a^s 






-0!^S 



4" t — s (ck 



I t G . 






Since the ruled surface is generated by such these lines V ^ we have ob- 
tained a parameterisation (s,t) x{s^t) of it. Eliminating the param- 
eters s,t we obtain x^x^ ~ whence the surface must be a 

hyperbolic paraboloid. 

For the second case we choose linear coordinates {x^yx'^^x^} such 
that the x^-Axis coincides with Is, the rr 2 ’-axis is parallel to and the x^- 
axis is parallel to ^i. We can chose the origin 0 of the coordinate system 
such that it lies in Is and such that I 2 lies in the plane = 0. Then 
there exist numbers G R such that Is = {x : x^ = = 0}, 

I 2 = {x : x^ = a^,x^ — 0}, and h — {x : x'^ — a^,x^ = a^}. Let P be a 
plane which contains Is and is not parallel to = 0. Then there exists 
an s G R such that P is given by x^ - sx^ — 0. Any line V of the second 
family which lies in P must intersect li and ^ 2 - We calculate 



{P n h } : x^ = $a^, x'^ — o? , x^ — 

{Pril2}-x^~a^, x^ — — , = 0 . 

5 



This gives the line 



\ j a*- — sa^ 
j / $ 1+^1 / s “ 



: t G 



0 



—Of'" 
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It follows that (s, t) are parameters of the ruled surface. If we eliminate 
(s, t) we obtain the equation 

-a^x^x^ + a^x^x^ - a^x'^x^ + a^a^x^ - 0 . 

This is a quadric. We could use the Gram-Schmidt-procedure to show 
that this quadric is affinely equivalent to a hyperboloid. But since any 
quadric in is affinely equivalent either to the sphere + {x^Y + 
(a:^) = 1, the two-dimensional pseudo-hyperbohc space, - (x^)^ -I- 

{x^Y+{x^Y = 1, the rotational hyperboloid, - (a;i)% + = 

~1, the cone = 0, the hyperbolic paraboloid - + 

(x^) - = 0, or a plane, we can infer without any further calculation 

that our surface must be affinely equivalent to a rotational hyperboloid. 



P. 3 i 
U P. 15] 



1.1,3 Euclidean geometry 

Euclidean geometry gives the local model of space. In the following 
sections we will obtain models of space & time which incorporate Eu- 
clidean geometry as description of space. Unless otherwise stated, here 
and in the following space has dimension n — 1. We assume that Eu- 
clidean geometry is known to the reader and therefore only summarise 
a few facts. 



In affine space, we have no definition for “length” or “angle” . Since these 
are fundamental concepts for our perception of space, we must endow 
affine space with an additional structure. The first scientific and ex- 
perimentally well tested description and axiomatisation of space involv- 
ing these notions culminated in the “Elements of Euclid" (ca. 340b.C.- 
270b.C.). In modern terminology, Euclid’s theory of space can be iden- 
tified with Euclidean geometry. 

The central object of Euclidean geometry is the scalar product. 
Definition 1.1.5. A scalar product on a real vector space Y is a map 



VxV-^R (1-1.4) 

(u, u) !-»■ (u, u) (1.1.5) 

such that for any u,v,w €Y, A, /r € R the properties 

(i) {u,\v + pw) = \{u,v) -\- ii{u,w), 

(ii) (u,v) ^ {v,u), 

(Hi) {u, u) > 0, 

(iv) (u,u) = 0 u = 0 



hold. 
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We can now define an Euclidean space as an affine space equipped with 
a scalar product. 

Definition 1.1.6. Euclidean space is apair {hP'~^ 'where 

(i) is the {n — l)-dimen$ional, real affine space with associated 
vector space 

(ii) is a scalar product onW^~^. 

A map (f>: is an isometry if and only if 



= {yi-xi,y2-x2)^n~i 



for allyi,xi,y 2 ,X 2 G ^ 



The physical notions we wish to capture with our mathematical def- 
initions are ^‘distance” and “angle”. The distance between two points 
x^y € A^~^ should only depend on the connecting vector u = y— rc. 
It is plausible to demand that the distance of x and x + Xu is X times 
the distance between x and x ^ u. Hen ce, given a scala r product, the 
definition dist(o:,y) := y^{x—y^x—y)^n~i seems to be a 

reasonable choice. 

It is clear, however, that in order to measure the angle between two 
vectors one needs a map x ^ M which is symmetric in both 
entries and remains unchanged if one of the vector is multiplied by a 
real number. The angle between two directions ti, v may therefore be 

defined by Z(u, v) = arccos J ■ 

It is not a priori clear that a scalar product is indeed the appropriate 
additional structure for defining lengths and angles. See (Weyl 1923, §19) 
for a theoretical justification of the usage of scalar products. 



Proposition 1.1.1. A map 'll): Euclidean struc- 

ture (A^~^, (•, invariant if and only if there exist a linear map 
A: R^”^ and points o ,6 G A’^"^ such that 'ip{x) = A{x—o) - 1--6 

and {Au^ Au)^r,.-i = {u,u)^r,.-i for all u E 



Proof Observe first that an application of {Au^ Au)^,,-i = {u, u)^r,.-i for 
all u E R^“^ to the vector u = v + w implies {Av^ Aw)^n.^i = {v^ 
for all v^w G. Hence the map x h-s- 'ijj{x) = A{x—o) -h b satisfies 



[p- 14 1] 
^4 

i P> 16 



{'lP{yi) - '(p{xi),ij{y2) - ij{x2))Rn-i 

= (A(yi-o) - A{xi-o),A{y 2 -o) - A{x 2 -o))^,,^^ 
= {A{yi ~~xi),A{y 2 ~X 2 ))^r,.-i 
— {yi ^1) y 2 ^2)]^n-i • 



In the proof of Proposition 1.1.1 we appeal to Theorem 1.1.1. 
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Conversely, any map ip which preserves the Euclidean structure preserves 
in particular the affine structure. Hence Theorem 1 . 1.1 implies that there 
is a linear map A and a point b such that ip{x) — A{x—o) + h for all 
a; 6 A. Since ip{y) - ip{x) = A{y - x) it is clear that A must satisfy 
(Au, Au)^„-i = for all u € | 



p. 15 I 
U P- 33] 



Remark 1.1.2. At first sight our definition of a Euclidean space may 
seem to be too general. The reader may feel that in space there is a 
subset of physically distinguished scalar products: 

Let e be a vector which we use as measuring stick defining unit length 
and E a plane which contains ei. Using a pair of compasses we can 
construct a line le C E which is orthogonal to ei and therefore also 
a vector C 2 of the same length as ei but perpendicular to ci. We may 
now construct a second plane Eej by rotating C 2 around ei and a third 
plane Es 2 by rotating ej around C 2 . The intersection Eg^ n Eg^ is a 
line orthogonal to ei and C 2 . Using again or pair of compasses we can 
construct a third vector es which is of unit length and orthogonal to ci 
and 62 . Our distinguished scalar product is now given by (cj, 6 j)|[j 3 = Sij. 

It follows firom the Theorem of Pythagoras that the length of a vector 
u is given by ||u||ir 3 . We can use a pair of compasses to approximately 
(but arbitrarily well) divide the circle into a fixed number of arcs thereby 
introducing an approximate measure of angle. Prom the definition of the 
cosine it is clear that (up to a constant factor depending on the number 
of arcs) the size of an angle is given by the definition above. 

However, this introduction of the standard scalar product is based 
on procedures which are intuitive but which cannot be defined in mathe- 
matical terms without having a scalar product in the first place. In fact, 
if we had started with any given scalar product (•, •) and had defined 



(i) a rotation as a linear map which leaves the scalar product invari- 
ant and 

(ii) a pair of compasses as a device which for each given plane E 
containing a given vector e produces all vectors e' £ E with the 
same length as e. 



then using our construction we would just have recovered (•, •). The fol- 
lowing proposition gives a mathematical explanation of this fact. 



Proposition 1 . 1 . 2 . Let (A” \(-,-)k,.-i) and (A"■-^ (•, •)g„_i) be two 
Euclidean spaces. Then there exists an affine map 'ip: ^ 

which satisfies 



(V’(yi) - '<p{xi),ip{y2) - Ip{x2))^„-1 = ~{yi - xuy2 - 2 ; 2 )k»-i 
for all xi , yi , X 2 , j /2 € A"“U 
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Proof. Choose any points o, 6 6 ^ and let {ei, . . . , e^-i} (respec- 

tively {ei, . . . , en~i}) be an orthonormal basis with respect to (••)jnr,.-i) 
(respectively ( • We define the linear map A by Aei — Then 

the affine map -^(a:) = A(a:— o) + 6 is the desired isomorphism. I 

The map 'if is often referred to as an Euclidean transformation. 

Corollary 1.1*1* Let a scalar product ofW^~'^. Then there 

is a basis {ei, . . . , Cn-i} ofW^~‘^ such that (e^, = 6ij, 'where 

^ fori^j, 

1 0 otherwise 

is the Kronecker symbol. 

Today, Euclidean geometry is often taught as a prime example for a 
closed and consistent mathematical theory. This obscures the fact that 
angles and distances are physically measurable and that therefore Eu- 
clidean geometry can be falsified as a physical theory. (For instance, 
one of the most influential philosophers since the time of enlightenment, 
Kant (1781), wrongly considered space as given “a priori”). 

In modern times, Carl Friedrich Gaufi (1777-1855) seems to have 
been the first to realise the possibility that Euclidean geometry may 
not be the correct description of our world — though the legend that 
he tried to verify Euclidean geometry by measuring the angles between 
three mountain summits is not true (Osserman 1995, page 66). He has 
developed a non-Euclidean geometry in which the parallel axiom does 
not hold but did not publish it. This geometry was also independently 
discovered by the Hungarian mathematician Janos Bolyai (1802-1860). 
Later in this book we will conclude that space should be described by 
geometries which are far more general than those considered by Gaufi 
and Bolyai. 



1.2 Absolute space and absolute time 

In this section 'we present the ^^naive’^ model of space and time. We 
will take care to show how complicated it really is. We will also give 
a short account of Newion^s theory of particles which is the main 
physical justification of this spacetime concept 

Time seems to have striking similarities with space but nevertheless to be 
something which is very different. Like space time is a continuum. How- 
ever, space is a 3-dimensional continuum while time is 1-dimensional. 
Moreover, we can freely move in space but merely drift in time. It is 
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1. Local theory of space and time 




Fig. 1.2.1. A curve in a 
spacetime diagram 



often practical to treat space and time in a unified manner. For instance, 
spacetime diagrams are used to describe movements (cf. Fig. 1.2.1). 
Hermann Minkowski (1864-1909) (Minkowski 1909) has pointed out that 
nobody has ever experienced space without time or time without space. 
This observation is borne out by characterising space & time in the 
following way. 

Definition 1.2.1. A primitive spacetime is set The points of a space- 
time are called events. 

Of course, this definition does not tell anything about the relation of 
space and time or even allows to distinguish between these concepts. 
In order to do so we must supplement the primitive spacetime with a 
geometrical structure. 

In the preceding section we have recalled that space can well be 
described by (n - l)-dimensional Euclidean space. The fact that time 
is 1 -dimensional indicates that spacetime can be considered as an n- 
dimensional affine space which is foliated by (n - l)-dimensional sub- 
spaces each of them carrying a Euclidean structure. Any foliation with 
affine hyperspaces corresponds to a linear map r : R, where x^y E 

are in the same hyperspace if and only r{x—y) — 0. Denote by 
Ex — {y E A^ : r{y^x) — 0} the affine hyperplane through x and let 
o, o' e Then the vector spaces associated with all these affine hyper- 
planes Ex {x E A'^) are identical. In fact, they are given by r”^(0) — 
(u e R’^ : r{v) ~ 0}. Hence we only have to specify one single Euclidean 
scalar product vector space r“^(0) in order to get a 

foliation of (n - l)-dimensional Euclidean spaces. This is in accordance 
with our experience that the geometry of space does not change from 
one instant of time to another. 

The map r can be interpreted as a world clock: The time difference 
between to events x and 0 is just r{z — x). Observe that r is uniquely 
defined up to a factor. This factor corresponds to the physical unit in 
which time is measured. 

We still need to link events in different hypersurfaces which corre- 
spond to the same point in space. The simplest way to do so is to intro- 
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duce as a second structure a vector t and interpret all points lying on the 
line a; + Rt as the same point in space at different times. If we normalise 
t by r(i) = 1 then the time difference between x and y = just t. 

Definition 1.2.2. A Newton spacetime is a quadruple 



(A”, t,T, (■,.),_, (0)), (1.2.6) 

where t € R^ a distinguished vector^ r : R^ — > R a linear map such that 
r(t) = 1, and (•, is a scalar product on the vector space r“^(0). 

This definition is just the content of Isaac Newton's (1642-1727) theory 
of absolute time and absolute space. ^ 




Fig. 1.2.2. Absolute space, abso- 
lute time 



We see that spacetime is fibred twice, By lines parallel to t and by 
hyperspaces of the form where o is some fixed event. This structure 
may appear quite cumbersome but it captures our naive point of view 
in a geometrical way. 

One can think of t as defining a time axis and therefore an absolute 
notion of rest, and think of r as defining an absolute notion of instant 
of time. The pair (r, t) induces a projection R^ — ^,R^, u f— ^ u, where 
V = + V and r(v) — 0. 

A map which leaves the structure of a Newton spacetime invariant 
consists of a spatial Euclidean transformation as given in Proposition 
1.1.2 and a spacetime translation. 

Proposition 1.2.1. A map -0: ^ leaves the Newton spacetime 

(A^, t, r, invariant if and only if there exist a linear map 

A : r“^(0) r~^(0) and points o, 6 G A”" such that 

> 

(i) 'ip{x) = A{x--o) + r{x--o)i -h b and 

(ii) {Au,Au)^^^q^ = for all u G r“^(0). 

^ In the next section we will discuss an improved spacetime model which is 
named after Galileo Galilei who lived before Newton. The reason for this 
is that Galilei emphasised different points than Newton, points which are 
more important to us nowadays. However, what will be referred to as a 
Galilei spacetime also incorporates ideas due to Newton. 
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Proof. It is easy to check that maps of this form are isomorphisms of New- 
ton structures. Conversely, observe that any affine map -0 which maps 
any affine hyperplane Ex into some affine hyperplanejB^:' is necessarily 
of the form 

^(x) = A{x—o) -I- r{x—o)v + b 

where ^ is a linear map of r"^(0) into itself, v e and o,b G 

That A satisfies {Au,Au)^^^q^ = for all u e r~^(0) 

follows from Proposition 1.1.1 and the fact that 0 restricted to Ex is an 
isometry of Euclidean spaces. 

Since the vector t is an invariant of the Newton spacetime the equa- 
tion o+t— o = i implies ' 0 (o+i)— ^ 0 ( 0 ) = t and therefore (r(i)u + b) — 6 = 
t. But this equation is equivalent to u = t. I 

Observe that the choice of o is irrelevant, it can always be absorbed by 

b. 

We call the set of all isomorphisms 0 of the Newton spacetime the 
Newton group JV. 

Given a Newton spacetime we can find a basis {ei, . . . , e^} of with 
respect to which r = (1, 0, . . . , I3), t = (1,0, . . . ,0)^, and = 



1.2.1 Non-relativistic particles 

Here we very briefly indicate elementary aspects of Newton^s theory 
of particle mechanics. We will only touch on those features which are 
necessary for later sections. This section is included for the benefit of 
mathematicians. 

A particle is thought to be a small material object without interior or 
exterior structure. This is of course a gross idealisation of many macro- 
scopic objects, but for some purposes surprisingly good. Billiard balls 
are typical examples. On the other hand, one cannot neglect the internal 
structure of a football. It will be noticeably deformed when hit. This 
contributes to its springiness and at the same time shows that the par- 
ticle model is not adequate. An American football has a shape which 
contributes to its movement when it rolls on a flat surface. Again, a 
particle description would be a bad approximation. 

Newton observed that even if all its structure can be neglected, a 
material object does carry a parameter which characterises its move- 
ment in spacetime. This parameter is its mass. 
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Definition 1.2.3. A non-relativistic particle with mass m is a pair 
(m,7) where m G and 7: t ^ j{t) e satisfies = 1. The 

curve 7 is called its world line in spacetime. ® 

It has been first expressed by Galilei (cf. Sect. 1.3) that under ideal 
conditions a particle which is not subjected to any external force moves 
along a straight line."^ 

Definition 1.2.4, A non-relativistic inertial particle zs a 
particle (m, 7) which satisfies 7 = 0, 

It is clear that a non-relativistic particle is inertial if and only if j(t) = 
X v) for some point x e AP' and some constant vector v with 

r{v) = 0. 

Of special interest to us are collisions of inertial particles. We un- 
derstand under a collision of particles any interaction of them which is 
confined to a compact subset of spacetime. It is best to think of collisions 
in the sense of colliding billiard balls. But we explicitly allow that par- 
ticles break up or stick together. Since the collision is confined in space 
and time it is possible to speak in connection with a collision of incoming 
and outgoing inertial particles. Let (m^, denote the incoming 

inertial particles and the outgoing inertial particles. Then 

the following laws are experimentally well justified.® 

(i) Conservation of mass. Yli=i ~ 

(ii) Conversation of spatial momentum, 

(iii) Conservation of kinetic energy. 




It is easy to see that these laws are invariant with respect to isomor- 
phisms in the Newton group Af. 

It is clear that most particles do not move along straight lines. In 
this case an external force must act on the particle in order to force it 
to take a different path. 



Definition 1.2.5. A (time dependent) force field F is a map F: A'^ 

r-HO). 



® We only need r( 7 (t)) > 0 in order to guarantee that the particle moves into 
its future. The normalisation r('y(t)) ~ 1 synchronises each particle with 
the world clock t. 

It is not absolutely clear whether Galilei really meant straight lines or more 
complicated curves which take into account the shape of the earth. 

® These laws are intimately linked to the homogeneity of space and time. This 
is the content of the Noether Theorem. For further details cf. any textbook 
on (theoretical) mechanics. 
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In a given force field F a particle 7 moves according to the differential 
equation 

= F, (1.2.7) 

In particular, vanishing force implies that 7 is an inertial particle. 

According to the physical interactions under consideration a particle 
may also carry a variety of_^other parameters besides m. As an example 
consider an electrical field E: Every particle (m, 7 ) carries 

another parameter q which_determines the force with which the electrical 
field acts on the particle, F = qE. 



1.3 Galilei’s theory of relativity 

In this section we drop some of the structure of Newton spacetime in 
order to arrive at Galilei’s theory of relativity. We also argue that his 
theory was revolutionary given the paradigms of the time. 

Galilei’s theory of relativity has been motivated by cosmology. We do 
not feel that the earth moves into any preferred direction. It is therefore 
plausible to believe that the earth is at rest and that all objects at the 
sky are moving around it: The sun rises in the East and during the course 
of a day moves to the West, and there are analogous descriptions of the 
movements of the moon and the stars. It was already well known that 
planets are not moving along strictly circular orbits. In the traditional 
cosmology of the Greek astronomer Claudius Ptolemeaus (ca, 100-160) 
this was accounted for by an elaborate construction using epicycles. 

It was a revolutionary act of Nicolaus Copernicus (1473-1543) to as- 
sert that the sun is the centre of the universe and that the earth is moving 
around the sun just like any other planet or star (Copernicus 1543). He 
did so in order to arrive at a model in which movements would be theo- 
retically more uniform and which would therefore be in better accordance 
with the teaching of the ancient Greek philosophers Pythagoras (ca. 570 
b.C.-500 b.C.) and Platon (ca. 428 b.G.-347 b.C.) (cf. (Kanitscheider 
1984)). However, his model was not only technically more complicated 
(using more epicycles than Ptolemeaus) but also encountered a number 
of serious problems. 

(i) If Copernicus was right one should be able to discover a parallaxis 
effect at the sphere of fixed stars. If the fix star sphere and the 
earth rotate both around the sun with different velocities, then one 
should observe different angles a, (3 between two neighbouring stars 
according to the time of the year. (Cf. Fig. 1.3,1). 

(ii) Some passages in the bible seem to contradict the theory of Coper- 
nicus. In particular, it states that Joshua stopped the sun for a few 
hours. This statement would not make sense if the sun would not 
have moved before. 
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(hi) The model of Copernicus is inhomogeneous. While all planets circle 
around the sun the moon definitely moves around the earth. No 
other exception was known. According to Ptolemeaus, every object 
in the sky moves around the earth. Hence the traditional system 
seems to be more homogeneous on a large scale and therefore to be 
advantageous. 

(iv) The laws of mechanics seem to contradict Copernicus’ hypothesis. 
Imagine a stone falling from the top of a tower. Since the tower 
(being fixed to the ground) would move together with the earth, 
one would not expect the freely falling stone to hit the ground at 
the foot of the tower. However, exactly this is everyday experience. 
(Cf. Fig. 1.3.2). 




Fig. 1.3.1. Parallaxis effect 




Problem (i) has been addressed by Copernicus himself. He just assumed 
that the sphere of fixed stars is so large that the parallaxis effect cannot 
be measured. Ironically, the true radius is orders of magnitudes larger 
than the radius he proposed. (He was just concerned with making the 
effect unobservable). The other three objections have been answered by 
Galileo Galilei (1564-1642) some 60 years later. 

Galilei was least successful with Problem (ii). While he could quote 
church authorities (for instance, Aurelius Augustinus (354-430)) to the 
effect that one should not interpret the bible literally when it comes to 
questions of physics, the establishment remained unconvinced. One of 
the reasons has been the fear to set a precedence. If people started to 
doubt any part of the writing they could as well start to be sceptical 
about other parts which are closer to the main doctrine. Hence there 
was a major threat to the whole building of Christian belief. The theory 
of Copernicus was put on the index and Galilei — after having written 
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a brilliant but rather defiant semi-popular book (Galilei 1632)^ on the 
matter — was sentenced to house arrest. He obtained this comparatively 
mild punishment after a public but insincere abdication of his scientific 
assertions. 

Galilei solved Problem (iii) by careful observation (Galilei 1610). The 
telescope had just been invented and Galilei was one of the first to use it 
as a scientific tool. He observed that the planet Jupiter also has moons 
and used this observation to show that the cosmological system of Ptole- 
meaus of the universe was not more homogeneous than the system of 
Copernicus, On the other hand, since it was believed that beyond the 
moon the world was filled with a medium very different from air, many 
philosophers doubted the accuracy of the telescope. They claimed there- 
fore that it was doubtful that the telescope which was acknowledged 
to work well on earth could be trusted when applied to the position of 
planets. Galilei argued that the telescope was accurate with respect to 
all the known phenomena in the sky and that it was therefore justified 
to use it as a scientific tool. 

Galilei solved Problem (iv) by asserting a law of inertia which asserts 
that a constant movement had no influence on physical processes. Galilei 
supplemented this law with the important physical assertion that com- 
plicated velocities can be decomposed into simpler ones. According to 
this law the stone would keep its initial tangential velocity while falling 
down and therefore come to rest at the foot of the tower — regardless 
of the velocity of the earth. It can be argued that this solution of the 
problem was the most revolutionary act in natural sciences and started 
physics as a scientific discipline in the modern sense. Recall that every- 
day experience seems to point against Galilei’s law of inertia: If we set 
a wagon into motion it will certainly come to a stop after some while. 
Moreover, there was a generally accepted physical theory by Aristotle 
(384 b.C.~322 b.C.) which explained this experimental fact. (The wagon 
has an initial impetus which is responsible for the movement and which 
is used up during the motion.) Galilei gave many examples to make his 
law of inertia plausible and to show that it is a law for a limiting case 
without friction. For instance, he claimed that a stone falling from the 
mast top of a smoothly sailing ship would also reach the ground at the 
foot of the mast.^^ 

^ This book is a literary and physical master piece. Even today it is well worth 
reading! 

As compelling this example may appear to us, at the time there were some 
good reasons to doubt it. Since the velocities involved are rather small it 
would be difficult to verify Galilei’s claim experimentally. Also, while the 
ship moves wind is blowing into the same direction. It is conceivable that the 
stone is just blown to the right position. (To value the merit of such counter 
arguments one has to be aware that at this time, good, quantitative physics 
has not yet been available). Some of these arguments have already been 
answered by Galilei, who, for instance, circumvented the wind argument by 
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Galilei realised that his law of inertia is not compatible with the 
notion of absolute rest. Instead he postulated a fundamental principle of 
relativity. 

Postulate 1.3.1 (Galileian relativity). For any two observers which 
move relative to each other with constant velocity all physical processes^^ 
are the same. 

It follows that the vector i in the definition of Newton spacetime which 
defines absolute rest does not have a physical meaning. (It is another sign 
of the originality of Galilei that Newton thought he had to re-introduce 
a concept which had already been shown to be superfluous). 

While we have lost the notion of absolute space we can still retain 
absolute time. Spacetime is then fibred by hyperplanes t = const and we 
obtain the following simpler structure of spacetime. 

Definition 1.3.1. A Galilei spacetime is a triple 

(*? ‘)t“^o)) ? (1.3.8) 

where r:W^ R is a non-zero linear map and (■,*)r-i(o) ^ scalar 

product on the vector space r”^(0). 




Fig. 1.3.3. Relative space, abso- 
lute time 



The linear map r defines a world clock by defining r{y — x) to be the 
time difference between to events x and y — exactly as in the Newtonian 
model presented above. In contrast to Newton’s spacetime we do not have 
the vector field t at our disposal and therefore there is no absolute rest 
space. We have replaced “absolute space” by a distinguished family of 
“inertial systems” or “inertial observers” . The notion of “Rest” can only 
be defined relative to an “inertial observer” : 

Definition 1.3.2. Let r, (•, be a Galilei spacetime. 

(i) A non-relativistic observer is a curve t ^{t) € such that 

r(7(^)) = 1. 



claiming that the physics in a cabin of a smoothly sailing ship would be 
exactly the same as on earth. 

Strictly speaking, he only considered mechanical processes. 
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(ii) A non-relativistic inertial observer is a curve of the form 
j{t) = a; + where i e r(t) = 1. 

(Hi) A non-relativistic observer jj, is at rest with respect to a non- 
relativistic inertial observer 7(f) ~ x -{-ti if fi(t) — i. 

Hence given a non-relativistic inertial observer ^(t) — x ti we obtain 
a splitting of spacetime into space and time relative to 7. Physically, 
this amounts to regarding the observer 7 as being at rest. We can also 
interpret i as a relative time axis. Relative to the non-relativistic inertial 
observer 7 we have thus recovered the structure of a Newton spacetime. 
Notice, however, that this is only possible by arbitrarily distinguishing 
one non-relativistic inertial observer. This motivates the following defi- 
nition. 

Definition Let t € E’^ be a vector with r(t) = 1. The pair (t, r) 

is called a non-relativistic reference frame. 

For any given reference frame (t, r) we obtain a map (•): E’^ t— 
via the unique decomposition v = u 4- where r{v) = 0. 

Proposition 1.3.1. A map 't/j: A'^ — > leaves the Galilei space- 

time (•, invariant if and only if there are a linear map 

A: r~^(0) a vector v e E’^, and points o,b € A'^ such that 

(i) 'ip{x) = A{x—o) + r{x—o)v + 6, 

(ii) r{v) = 1, and 

(Hi) {Au,Au)^^^q^ = for all u e r“^(0). 

Proof It is straightforward to check that maps of this form are isomor- 
phisms of Galilei spacetimes. Conversely, observe that any affine map ijj 
which maps each affine hyperplane Ex into some other affine hyperplane 
Ex' is necessarily of the form 



'ip{x) = A{x—o) 4- r{x—o)v -f b 

where A is a linear map of r“^(0) into itself, u G E^ and o, 6 G A^, 
Since ^ preserves r we have r{i){x)—^l;{o)) = r{x—o) for all x^o e 
A^. Hence we obtain 



T - o) + r{x—o)v + b~bj ~ r{x—o)r{v) = t{x—o) 
which in turn implies r{v) = 1. 

The third property follows since A must preserve the Euclidean scalar 
product I 

The Galilei group Q is the group of maps which leaves the Galilei space- 
time invariant. 
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It should be noted that the Galilei spacetime is compatible with 
Newton’s theory of particles as described at the end of Sect. 1.2. The 
Galilei spacetime was well accepted as the correct model of space and 
time for more than 200 years. However, in the 19th century a theory of 
electro-magnetism emerged which, together with this spacetime model, 
was incompatible with Postulate 1.3.1. Still, scientists continued to think 
that the postulate would hold for mechanical processes. 



1.4 Einstein’s special theory of relativity 

We start with a discussion of the fundamental Michels on- Morley Ex- 
periment which indicates that the velocity of light has an absolute 
value c. These findings indicate that the set of all possible light rays 
form a further invariant of nature. We will see that this leads to the 
structure of a Minkowski spacetime (Theorem L4-Vf equivalently, 
to Einstein’s special theory of relativity. We use the results from the 
two preceding sections to show that there is no need for additional 
structures in spacetime. In Sect 1.4’ 2 we give a short discussion of 
some consequences of special relativity such as the “Twin paradox” 
(which, of course, is not paradoxical at all). 

The proof of the fundamental Theorem 1.4’ 1 requires section 1.1.2 




Fig. 1.4.1. A flash of light at times to = 0, 

tl, t2 



In the 17th century two important properties of light emerged. 

In 1676 Olaf Romer discovered that the velocity of light is finite. He 
did this by noticing that the there was a yearly oscillation in the periods 
of the moons of Jupiter. 

The Dutch physicist Christian Huygens (1629-1695) developed a 
wave theory of light (Huygens 1690). In a very superficial way, we may 
view light as an analogon to water waves. 

The following two paragraphs should not be taken too seriously by the 
reader. We give an overly simplified version of the wave theory of light 
— just enough in order to understand the Michelson-Morley experiment 
presented below. Moreover, today the theory of quantum electro dynamics 
provides a much deeper understanding. 




28 



1. Local theory of space and time 



In water waves each “drop of water” individually moves in a cir- 
cle thereby inducing a similar movement (with a small time delay) 
of neighbouring drops. All these moving drops together form a wave 
(cf. Fig. 1.4.2) Since each drop is influenced by the neighbouring drops 
these time delays accumulate and the whole wave seems to move. If two 
different waves meet then (in a very rough approximation) they sim- 
ply linearly superpose each other. This will result in a characteristic 
(and often complicated) pattern, the “interference pattern” . In partic- 
ular, this superposition will result in a much larger wave if both waves 
are synchronised and in the other extreme they may cancel, 

Diffraction experiments indicate that this crude picture qualitatively 
also applies to light for which, however, matters are mathematically sim- 
pler. Again using a very rough model, one may think of the electrical 
field E at each point as oscillating up and down with respect of a fixed 
direction. The influence of neighbouring points gives rise of to a wave 
as described above. The wave length A is the distance between two con- 
secutive maxima and very small. It specifies the colour of light. If two 
waves are superimposed then the result may be brighter if they are syn- 
chronised. In the other extreme, the waves may even cancel altogether if 
the setup is arranged such that maxima and minima (of the same size) 
are superimposed. In this case the result is darkness, (cf. Fig. 1.4.3). 

In order to explain the wave nature of light one used to believe that 
space is filled with a substance called “ether” which plays the same role 
as the water for the water waves. An important problem would then be 
to determine the movement of the earth with respect to the ether. 




Fig. 1.4.2. A wave consisting of linked os- 
cillations 



Since the velocity of light should not be directionally dependent, in 
the non-relativistic reference frame connected with the ether a flash of 
light should propagate in concentric spheres (cf. Fig. 1.4.1). The cor- 
responding picture in spacetime would be a cone. To be more precise, 
consider the non-relativistic reference frame of the ether, given by the 
pair (t, r). Let o be the event at which the flash of light is emitted and 



13 



For water waves, this linear superposition is in fact a rather bad approxi- 
mation. 
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Fig. 1.4.3. Superposition of waves 




Fig. 1.4.4. Future light 
cone and Galileian rela- 
tivity. The observer mov- 
ing with spatial velocity v 
measures a different centre 
of the flash of light and 
therefore different radii di, 
d 2 for its wave front 



Eq = {x £ : r{x—’o) ~ 0} be the instant of time defined by o. We 

have a foliation {Eo-\-ti}teu of with spatial hyperspaces. Each vector 
V can be uniquely decomposed into spatial and temporal components, 
'0 = where r{v) = 0. A light ray which is sent out B.tx^Eo with 

the spatial velocity c describes the curve o + M(t -he) in A^. Hence with 
respect to a reference frame fixed to the ether a flash of light corresponds 
to the future light cone 

= |y e A" : ||c||^-i(0)(r(j/-o))2 = > o} 

in spacetime. The fact that the field x ^ Ct = ct + X—0 of fu- 
ture light cones are not invariant with respect to Galilei transformations 
(cf. Fig. 1.4,4) would enable one to measure the reference frame of the 
ether, i.e, the movement of the earth with respect to the ether. This was 
the aim of Albert Abraham Michelson (1852-1931) and Edward Williams 
Morley (1838-1923) (Michelson 1881), (Michelson and Morley 1887) in 
their famous interference experiment (cf. Figs. 1.4.5, 1.4.6). A light ray 
is partially reflected at a half silvered mirror id. The part of the light 
ray which is not reflected at H is reflected at a mirror M and then par- 
tially reflected at H before reaching the observer O. The part of the ray 
which is immediately reflected at H is reflected by a mirror M' and then 
(partially) passes through H to arrive at the observer O. The distance 
between M and H is I whereas the distance between H and M' is V. 
Both light rays have the same intensity when they arrive at O. Here they 
produce an interference pattern which allows to measure the difference 
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Fig* 1.4*5. Michelson-Morley ex- 
periment, at rest relative to the 
ether 






M' M'M' 

Fig. 1.4.6. Michelson-Morley ex- 
periment, moving relative to the 
ether 



of the distances which each light ray has travelled (Here one makes use 
of the fact that the wave lengths of visible light are extremely small 
and that the superposition effect allows to measure the distance which 
a light ray has travelled at an accuracy of half a wave length). Since the 
laboratory is at rest with respect to the earth, according to Galileian 
relat ivity the interference pattern should depend on the angle between 
HM and the velocity of the earth. Let c G be the modulus of the 
velocity of the light in the ether , v be the velocity of the earth relative to 
the ether and assume first that HM || v (cf. Fig. 1.4.6). The first part of 
the light ray will travel from H to Mi in time ti and cover the distance 
cti ~ I + ||'i?||r-i(o)^i- If it travels from M to if in time t 2 , it will cover 
the distance ct2 ~ I — \\v\\r~^ { 0 )^ 2 - These equations imply 



+ ^2 — 



2l/c 



(1.4.9) 



The other part of the light ray travels in time f from H to M' and in 
the same time back to H, thereby covering the distance 



2ct' = 2^1'^ + 



This gives 



2t' = 



2V/c 



\/l - 



(1.4.10) 



We are interested in the time difference At ~ 2^^--(^i+^2) for both paths. 
Since the number |lf^||r“i(o)/^ is very small we only need to calculate the 
time difference to second order in ||F||.,— i(o)/c. 



At — ^2 — = 



•: 1 - ll»i;-.(0)/=" 




1.4 Einstein’s special theory of relativity 



31 



2^+Kihi?-x(o)/cV-z' 

2 61 (-1 + i(||^1|^.(o)/^^) + 

~ C 1 - |M|2-i(0)/c2 

« \ (<5^ (-1 + + \mU(0)/c^i) 

X (l + ll^ll?-l(0)/c^) 

I II^I^KO) 251 ( lll^1l?-.(0)^ 

C c2 C y 2 c2 )’ 



where /' = i + 5L This gives a displacement per wave length A of 



ZiZii 



cAt HI^II?-i(o) 25l 1 i|v||?-i(o)^ 



It follows that the interference depends crucially on the length difference 
SI which cannot be measured accurately enough. In order to overcome 
this difficulty Michelson and Morley turned the whole setup by 7r/2 and 
measured the interference difference. For the rotated setup we must set 
At = 2t' ~ti ~t 2 and interchange I, V, An analogous (but in the details 
slightly different) calculation gives 



cAt I 

AZ\ — - ■ 

^ A A 






(? 



+ 



1 + 



11^11 



'(0) 



The relative displacement depends on SI only up to second order and is 
given by 

Ziz = ziz, - ziZx » f (i - f ) . 



If one assumes that the sun rests relative to the ether, uses the Hg 
spectral line with A w 5.461 • 10“^*^m and has I « 21m then one would 
obtain AZ ^ 0.4 which is well in the range which can be observed. 
However, all such experiments had a negative outcome. 

A possible explanation of this negative outcome is that the earth rests 
with respect to the ether. But the earth circles around the sun which it- 
self rotates in our galaxy. Since in the course of the year the earth changes 
its velocity direction relative to these other velocities, it is inconceivable 
that all year round the velocity of the earth relative to the ether can be 
neglected. Another explanation would be that light moves like particles 
and that therefore Galileian relativity would apply to light as well. Since 
the light was from earth bound sources and the observations have been 
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made on earth, this assumption would explain the negative outcome of 
the experiment. The Michelson-Morley experiment has therefore been 
repeated using star light again with negative results. Now one could 
argue that as soon as the starlight is reflected at mirrors fixed to the 
earth, the light reflected light should be viewed as being produced on 
the earth. This would explain even negative results for star light in the 
framework of Galileian relativity (Hasse 1995). While this explanation is 
conceivable, it would demand a new theory of reflection. It is much sim- 
pler to assume that the velocity of light is independent of the movement 
of its source. This is the traditional interpretation which we will adopt 
in this book. It has been given further support by many consequences 
of the resulting theory (for instance, the possibility of obtaining huge 
amounts of energy from nuclear fission and nuclear fusion). 

Since our interpretation of the experiment of Michelson and Morley 
is in contradiction to Galilei^s theory of relativity we have to reconsider 
the foundations of spacetime. In order to do so we start with our new 
insight about the nature of light propagation, i.e. that the set of possible 
future light cones is an invariant structure of spacetime. Here and in the 
following we will chose units such that c = 1. (In the Sl-system, one has 
c = 2.99792458 -lO^m/s.) 

Postulate 1.4.1 (Invariance of the future light cones). 

Spacetime can be identified with together with an invariant 
field of future light cones C} = -f rc— o, x € 

We start the investigation of this postulate, by first determining all maps 
which leave this future light cone structure invariant. To simplify the dis- 
cussion we choose again a non-relativistic reference frame (r, t) and de- 
note by (•) the induced projection of to r“^(0). Defining the bilinear 
form 

R, (u,u) f-> r}{u,v) = -r{u)r{v) + 

we can write 

Co: = {y e : 77(y-x, y-x) = 0 and = {y € • r{y-x) < 0}. 

We call Cx the light cone and C~ the past light cone at a:. It is clear that 
a transformation which leaves the field of the future light cones invariant 
must also leave the field of light cones x ^ Cx invariant. The bilinear 
form rjQ is a Minkowski metric as defined below. 

Definition 1.4.1. A Minkowski metric rj is a constant bilinear form on 
with signature (— , -h, . . . , +).^^ 

This means that there is an “orthonormal basis” as defined directly below. 




1.4 Einstein’s special theory of relativity 33 



A basis {eo, . . . , e^-i} ofW^ is called an orthonormal basis with re- 
spect to T] ifrj{eo,ea) = ~Soa ?](ei,ej) = 5ij for all t, j € {1, , . . ,n - 
1}, ae {0,...,n-l}. 

A Lorentz transformation is a linear map L: ^ such that 

7 ]{uyv) = rj{Lu,Lv) for all u,v e The set of all Lorentz transforma- 
tion is the the Lorentz group and denoted by 0(n, 1). 

It is now easy to find a class of maps which leave the light cone structure 
invariant. Let a € M \ {0}, b eW, o e and (f>: ^ he given 

by <l){x) = aL{x—o) + 6. It is immediate that 4>{Cy) - C^(^y) for all 
y and that therefore the transformation (j) satisfies our invariance 

requirement. The following theorem due to Alexandrov (1950) implies 
that all isomorphisms are of this special form. 



Theorem 1.4.1. Let n > 3 and (/>: ^ be a bijective map such 

that <f> and (f>~^ map lightlike vectors into lightlike vectors, r]{x—y, x-^y) — 
0 ~ 0 analogously for Then there 

exist an L e 0{n, 1); an a € R \ {0}^ an o e and a 6 € R^ such that 
<f{x) = aL{x—o) + b for all x E A^, 

There are several proofs of this result. For instance, Benz (1992) shows 
that the theorem follows from the fundamental theorem of Laguerre ge- 
ometry. We will follow Alexandrov (1975) who gave a particularly elegant 
proof. His proof rests on results in affine geometry which have been given 




in Sect. 1.1.1. 



Proof of Theorem I 4 .I. Let y E Then rj{y-x,y—x) ^ 0 and by 
assumption rj{f){y)-(l){x),(f){y)-(i){x)) = 0. The last equality implies 
(f){y) e C'^(x) and therefore that 4> maps generators of the light cone 
Cx into the light cone C^(^x)' 

Now assume that P is a two-plane which intersects Cx in two genera- 
tors lx I'x' We will show that <j){P) is also a two-plane. Since for any y E lx 
the cone Cy intersects P in generators ly = lx and l^y which is parallel to 
Vx (and similarly for y^ El'x)^ P is ruled by two different families of par- 
allel generators. Since (f maps generators into generators 4>{P) must also 
be a surface with two different rulings and, by Theorem 1,1.2, be affinely 
equivalent to either a plane, a rotational hyperboloid, or a hyperbolic 
paraboloid. Each generator in P of one family intersects all generators 
of the other family. Since in a rotational hyperboloid the generators at 
opposite points of the circle c{$) lie in parallel planes, <^(P) cannot be 
affinely equivalent to a rotational hyperboloid. To see that 0(P) cannot 
be affinely equivalent to a hyperbolic paraboloid note that in a hyper- 
bolic paraboloid all generators of a given family are parallel to a single 
2-plane and that this property is affinely invariant. Consider (f}{y) E 4(x) 



The proof of this theorem requires the material presented in Sect. 1.1.2. 
The theorem but not its proof is essential for the following. 
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and fix any 2-plane Q. Then there are exactly two generators ii, I2 of 
which are parallel to Q* H we now (parallely) translate the cone 
along we see that the generators of the translated cone which are 
parallel to Q must also be parallel to h and I2. This implies that <^(P) is 
generated by a family of parallel lines. Hence the hyperbolic paraboloid 
degenerates to a plane. 

We can now show that ^ is an afhne map. Let I be any line and 
X € I 3.nd consider two different planes Pi, P 2 which contain I and 
intersect (7x \ {:r}. The intersections of these planes with Cx consist of 
two generators each. Then <p{Pi) and <?!>(P 2 ) are also planes and their 
intersection a line. It follows from Theorem 1.1.1 that <j) is affine. 

Since the property rj{y-x,y-x) = 0^ 

0 is translation invariant we can without loss of generality assume that 
(j) is linear. Let t be a vector with t) = -1 and let e be a vector with 



r/(e,e) = l, r?(e,t) = 0. (1.4.11) 



Then 0 == ? 7 (eit, c±t) implies 0 — r;((;i(e)±(^(t),^(e)±(;6(t)) and therefore 
r/((^(e), ^(e)) = — ^(0(t),^(t)) =: a, ??(^(e), (^(t)) = 0. Any vector v can 
be decomposed into v v,td- vii, where e satisfies Equations ( 1 . 4 . 11 ) 
and v^,Vi G M. We obtain y{(p{v),(f>{v)) = a 7 ]{v^v) which implies that 
—(f) leaves the quadratic form associated with y invariant. But then it 
must also leave rj invariant. | 
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Definition 1.4*2* Let rj be a Minkowski metric, 
called Minkowski spacetime. 



The pair {A^,r}) is 



Using orthonormal bases it is easy to see that all Minkowski spacetimes 
are isomorphic, i.e., we can speak of ‘‘the” Minkowski spacetime. 

In a Minkowski spacetime there is no designation of future and past. 
(Observe that we needed the 1-form r in order to define the future direc- 
tion.) Observe that the set C‘o\{o} consists of two connected components, 
say \ { 0 } and C~ \ {o}. We may now choose (respectively, C~) as 
the set of events in Cx to the future (respectively, past) of o (including 
o) . Hence Cq is the set of all events which can be reached by a light ray 
with source in o. By continuity, this also determines the future direction 
at any other event x e A'^ where (7+ =: {x-o) + C+. Observe that this 
definition coincides with our previous definition in the case rj ~ tjq. Let 
u e be a vector with rj{v, v) ~< 0. Then we have 

(i) either rj{v^w) < 0 for all w € and r){v^w) > 0 for all w € C~ 

(ii) or rj{v, w) > 0 for all w G and rj{v,w) < 0 for all w G C~ . 

Hence we can alternatively define the future direction by singling out 
a vector v G R’^ with rj{v,v) < 0. Since it is more practical to work 
with vectors than with connected components of light cones one usually 
chooses this alternative definition. 
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Definition 1.4*3. Let a Minkowski spacetime, A time orienta- 

tion is an equivalence class [v] of vectors in such that rj(v,w) < 0 for 
all VyW e [t;]. 

Given a time orientation [z;]; we say that the future light cone at 
o € A is the set = {o-^w ^ Co : rj(w,^v) < 0} and the past light cone 
is the set C~ — {oAw € Co' rj{w, z;) > 0} . 

Let V be the invariance group of the light cone structure and V be 
the group of Poincare transformations, V = {x L{x--o) + 6 : 6 G 
o e A^, L € 0(m, 1)}. Given an orientation [z;], we call nV 

the group of time orientation preserving Poincare transformations. The 
discussion above suggests to reduce the group ^ to a subgroup V~^ by 
asserting that the elements of V‘^ map future light cones into future light 
cones. 

Lemma 1.4.1. Let {A^,rj) a Minkowski spacetime and [v\ a time ori- 
entation, Then V'^ €V \ ^ subgroup off. 

Proof Clear. I 



Lemma 1.4.2. The transformation <p is an element off if and only if 
there is an a gR \ {0} and a ^ eV such that <j) = a\jj; (j) G f^ if and 
only if there is an a e \ {0} and a z/? G such that (f) = a^ip. 

Proof This follows directly form Theorem 1.4.1. I 

A priori it is conceivable that there exist other fundamental invariants 
of space and time which would restrict the group even further. We will 
now show that because of the validity of Euclidean geometry and the 
principle of Galileian relativity 1.3.1 this is not the case. 

Proposition 1.4.1, Fix a non-relativistic, inertial observer t o ti 
and consider its associated Newton spacetime (A^, t, r, (•, For 

each X € A let Ex ~ {y e A^ : r{y—x) = 0}. 

(i) There exists a Minkowski metric g which generates the light cones 
as measured by the observer t o + ti. Further, this Minkowski 
metric is unique up to a multiple. 

(a) Let X e A^. The map 4> e f restricted to the Euclidean space 
Ex, *)r-i(o) isometry if and only if (j> ^V. 

(Hi) Let be a subgroup of the Poincare group V such that 

(a) for each Euclidean isometry Eq ^ Eq> there exists acj) eV' 
with ~ 

(b) for each non-relativistic observer t ^ o' ti' with < 0 

there is a (j) with <f>{o + — o' R“^P, 

(c) All <f> £V' preserve the time orientation. 
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Then = 

Proof, (i): The adapted Minkowski metric rj is just given by rj{u^v) = 
—t{u)t{v) -h (u, Observe that r{w) = —7]{t^w) for all w € MP. 

(ii) : This follows directly form Lemma 1.4.2. 

(iii) : We will first show that for any two different vectors Cq, Cq with 

??(eo, eo) — r?(eo, e^) = -1 and 7]{eo, Bq) < 0 there is a Lorentz transfor- 
mation L which leaves span{eo,e/0} and its r?-orthogonal complement 
span{eo, e'O}-^ = {ti; € A : t]{w^€q) ~ rj{w^e^Qf) ~ 0} invariant. There is 
a vector v with \\eo + v an d r]{eo,v ) = 0. Since rj{eo, v) we have 
ri{v,v) > 0 and the vector ei - v/^/qiv^ is well defined. By definition 
it satisfies 7](eo, ei) = 0 and 77(615 ei) = 1. We complete the set of linearly 
independent vectors {eo) 61} to an ?7-orthonormal basis {eo, ei, . . . , e^_i} 
of . Let L be the Lorentz transformation defined by 

Leo = (eo + v)/^\ - = e'o, 

Lei — (ei + |b||R«-ieo)/-^l - ||v|!Rn-i^, and 

Lei = e» Vi € {2, . . . ,n}. 

It maps eo into Cq and leaves the subspaces span{t, ei}, span{ei} (i 6 
2, ... ,n - 1}) invariant. This transformation is called a Lorentz boost. 

We will now show that the group G generated by all Newton 
transformations with respect to the inertial frame (eo,»7(eo, •)) and all 
Poincare transformations (j> of the form <j>{x) = o + L{x—o) where L is 
a Lorentz boost coincides with the group of time orientation pre- 
serving Poincar4 maps. It is clear that G is a subgroup of P+. To show 
the converse Let o,o' € A” and {cq, . . . ,e„_i}, {sq, . . two or- 

thonormal bases with respect to 1 ]. We have to show that the Poincare 
transformation tp which maps o into o' and o -f e* into o' + e'^ for all 
i is a composition of maps in H and G. In each basis there is ex- 
actly one vector e* (respectively, ej) with ri{ek,ek) = -1 (respectively, 
= -1) and ri{ek,i) < 0 (respectively, < 0). We can 

renumber the basis vectors such that = eo, ej = Cq. If eo = e'o 
let ipi be the map x i-> a; -|- (o'— o). If eo ^ e'o then there is a vec- 
tor V € IR” with ri(v,eo) = 0, 0 < r](v,v) < 1, and e'o || eo -f v. In 
this case let ‘ipi be an element of the Newton group with respect to 
non-relativistic inertial frame (eo,??(eo,-)) which maps o to o' and ei to 
^/II^IIt-i(o)- Consider the Poincare transformation 'ip 2 {x) = o + L(x—o) 
where L is Lorentzian boost which maps eo into e'o and leaves the 
plane span(eo,e'0} invariant. Observe that {eo,'i/’i(ei), • . • , V’i(en-i)} 
and {(‘>p 2 )~^(e'o), (ip 2 )~^(e'i), . . . , (ip 2 )~^(e'„_i)} are both orthonormal 
with respect to rj and that 

span{V>i(ei), . . . , V'i(en-i)} = span{(V’ 2 )"He'i), • • • , (V’2)“Hen-i)} 
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= {x : r){x - o,eo) = 0}. 

Hence there is element 'ips of the Newton group with respect the to 
non-relativistic inertial frame (eo,?7(eo, •)) which leaves o' invariant and 
satisfies 'tps ^ — 1, ... - 1). This implies *0 = 

Finally we can show that is the only subgroup of V which satisfies 
(a), (b), (c). 

Let Eo Eo> be an Euclidean isometry. Then i/j exten ds to a 
unique transformation which leaves t invariant, 4>{x) = o) + 

r{x—o)i. Hence the Newton group AT is a sub-group of V'. 

Let o' e and let t' be a vector with Since r(t') = 1 and 7^(1', t') < 0. 
By assumption there is a t/? € P which satisfies (j){o -f R’^t) = o' -f R'^t'. 
In particular, the associated Lorentz transformation maps t into R+t'. 
We have seen above that there are Newton transformations ipz with 
respect to (t, r) such that '0 = '^2 ° ^3 o , where a 'ip 2 is the Poincare 
transformation that corresponds to a Lorentz boost which maps t into 
R'^t' and leaves the spaces span{t, t'}, and its 77 -orthogonal complement 
span{t, t'}^ invariant. Hence ip 2 is the composition of maps provided by 
assumption (a) and (b). Since t' was arbitrary we obtain that V' contains 
all Lorentzian boosts and all elements of the Newton group associated 
with (t,r). Consequently, — P'. I 

Let (t, r) be a non- relativistic reference frame and Eo be a hyperspace in 
spacetime which represents an instant of time. If we assume the axioms of 
Euclidean geometry for Eo then, to be consistent, we have to assume that 
this structure is invariant with respect to any transformation which is 
an isomorphism of our physical structure. Hence (ii) of Proposition 1.4.1 
implies that we must restrict to the Poincare group V. Preservation of 
time orientation reduces this group to By Proposition 1.4.1 (iii), 
the axioms of Euclidean geometry, and Postulate 1.3.1, the Poincare 
group cannot be further reduced. Hence we conclude that space and 
time are well described by a Minkowski spacetime together with a time 
orientation. 

The (arbitrary) non- relativistic inertial frame (t, r) we have started 
with satisfies ? 7 (t, t) = —1. Recall that t was the velocity vector of the 
inertial observer t o + ti who was supposed to be at rest. Since the 
Poincare transformations are the isomorphisms of Minkowski spacetime, 
any other inertial observer ^ o' -h it' who can be supposed to be at rest 
must be linked to t o -h ft by a time orientation preserving Poincare 
transformation If we denote the associated linear transformation by 
L(fj then t' € R*^L^(t) holds. In particular, all admissible observers t' 
must satisfy ? 7 (t', i') < 0. 



Observe that t' L^(t) in general since r which is used to normalise t' is 
not preserved by Lorentz transformations. 
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Definition 1.4.4. Let be a Minkowski spacetime with time ori- 

entation [t;]. 

(i) A (special-relativistic) infinitesimal observer is a vector t € 
with rj{iA) = —1? p{v,i) < 0. 

(a) A (special-relativistic) observer is a curve 7: 1 1-> 7(t) such that 
the velocity vector 'j{t) is a special-relativistic infinitesimal observer 
for all t, 

(Hi) A (special-relativistic) inertial observer is a curve 'j: t ^ x -\- 
ti 6 where x E AA and i is a special-relativistic infinitesimal 
observer. 

For any special-relativistic infinitesimal observer e E^ and any any 
o" G A^ there is a time orientation preserving Poincare transformation 
^ which maps t o + ti to for all t. 

The rest space with respect to (t, r) at the event o G coincides 
with Eo = {x e A'^ : r){x—o^ t) — 0}. Let (j) G such that its associated 
Lorentz transformation maps t to t'. Since <j) maps E into the set E' — 
{x' e A^ : 77(0:'— o', F) = 0} this set must be interpreted as the rest space 
with respect to the special-relativistic inertial observer t o' 4- tt'. In 
general, this space does not coincide with the non-relativistic rest space 
Eo' . Hence we arrive at the following definition. 

Definition 1.4.5, Let i be a special-relativistic infinitesimal observer. 

(i) The infinitesimal rest space with respect to i is the set := 
{w^W^ : 7]{i',w) =0} 

(ii) The affine rest space containing o' G A’^ with respect to i is 
given by d -f- 1'"^ C A^. 

The affine rest space inherits the Euclidean scalar product 
7]{u^v) {v^w G (t0"^)‘ 

Similarly, the time difference Att^ {x, y) of two events x, y with respect 
to the special-relativistic infinitesimal observer t' is given by t where 
t G E is the (unique) number such that x-\-ti' lies in the affine rest space 
of t' which contains y. 

The original non-relativistic observer t i— ^ o -h tt is also a special- 
relativistic observer and for this observer the non-relativistic and special- 
relativistic definitions for rest space with associated Euclidean scalar 
product and time difference coincide. The relativity Postulate 1.3.1 im- 
plies that for every other special- relativistic inertial observer o' -j- ti' 
lengths and angle should be measured by {«, *)(tq-L and time differences 
by 

In conclusion, a Minkowski spacetime together with a time orienta- 
tion contain all the geometric information of spacetime. This geometric 
structure is mathematically simpler and more elegant than the structure 
of a Galilei spacetime. 
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Remark The geometric structure of spacetime was discovered by 

Minkowski (1909). But before him Albert Einstein (1 879-1 955) (Einstein 
1905) had realised that absolute time does not exist and came to an 
equivalent but less elegant description of spacetime. His work contains 
the main physical discovery which justifies to speak of Einstein’s special 
theory of relativity. An important precursor of Einstein was Hendrik 
Antoon Lorentz (1853-1928) whose explanation of the Michelson-Morley 
experiment anticipated the length contraction^^. 

1.4.1 Causality in special relativity 

We start with some terminology which will be justified below. 
Definition 1.4.6. Let (A^,?^) be the Minkowski spacetime. 

(i) A vector w is called spacelike, ifrj{w,w) > 0, timelike ifri{w^w) < 
0, and lightlike (or nullj ifrj{w,w) =0. A vector w is called causal 
if it is timelike or lightlike, 

(ii) Let [t] be a time orientation. A causal vector u is called future 
directed ('past directed^ ifrj{u,i) < 0. 

(Hi) A curve 7 is called spacelike (resp., timelike, lightlike (or null^, 
causal, future directed, past directed ) if all its velocity vectors 7 are 
spacelike (resp.^ timelike, lightlike (or nullj, causal, future directed, 
past directed 

Let u; be a vector and x^y e A'^ with y — x w. If w is spacelike then 
there is an infinitesimal observer t such that r}{Wji) — 0. This implies 
that the events x and y ~ x -\-w lie in the same affine rest space with 
respect to t, in particular, these events are taking place at the same time 
with t. Hence there cannot be any causal process which links a: to y or 
vice versa. 

On the other han d, if w ~ y—x is tim elike and future directed then 
either w/ ^/~r|{w^w) or -w/ ^/-rj{w,w) is an infinitesimal observer. For 
definiteness assume that w/^J~r}{w,w) is in the time orientation. The 
inertial observer t xi-t-j ^f connects x with y. Hence the event x 

y' —r){w^w) 

definitely must take place before the event y. This motivates our causality 
definitions above (see also Postulate 8.0.1). 

This discussion also implies that the field of light cones x serves 

as a causal boundary. 

Corollary 1.4.1. Let {A^.y) be a Minkowski spacetime. 

(i) The set of all events y 6 x which can be causally influenced by 
processes taking place at x are given by J'^{x) := {y e A^ : y can 
be reached from x by a causal, future directed curve} . 
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His interpretation was different from Einstein’s, however. 
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(ii) dj+{x) = c+. 

Above we have seen that the most important ingredient of our discussion 
is the fact that the isomorphisms of spacetime are just the elements of 
the time orientation preserving Poincare group . The main step to 
arrive at is contained in Theorem 1.4.1 in connection with Postu- 
late 1.4.1. One may object that the constancy of the velocity of light is 
a rather awkward postulate. However, it is closely linked to the funda- 
mental notion of causality. The following theorem which has also been 
obtained by Alexandrov (1975) allows to replace the light cone structure 
in our fundamental postulate by the assumption of causality. 



[p. 34 i] 
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Theorem 1.4.2. Let n > 3 and consider the Minkowski space 

Let (/>: be a bijective map such that 4> and both respect 

causality: y € J'^{x) € J'^{(j){x)) for all x^y G A^. 

Then there exist an L £ 0(n, 1), an a 6 R \ {0}, an o £ A’^, and a 
b £W^ such that <f>{x) = aL{x—o) -f b for all x e 

Proof, This theorem is a corollary to Theorem 1.4.1. We only have to 
show that lightlike vectors are mapped into lightlike vectors by (j) and 

r'. 



We will first prove that for y £ ^ y the condition 

rj{y—x^y—x) = 0 is equivalent to the assertion A{x,y) :<^ “for all 
zi^Z 2 € J~^{x) n J~{y) we have either Zi £ J^{z 2 ) or Z 2 £ 

If T]{y—x^y—x) = 0 then J'^{x) O J~(y) is a part of a single light 
ray. Clearly, any two points on a light ray are causally related. If 
ri{x—y,x—y) < 0 then J~^{x) H J~{y) contains an open set U C A^. 
It follows that U must contain points zi , Z 2 which are connected via a 
spacelike vector w. It is clear that ^ and Z 2 ^ 

Now we will prove the theorem. If x—y is lightlike then A(x,y) 
holds. Since A(x^y) is formulated entirely in terms of causal relation- 
ships, A{(p{x), (f>{y)) must also hold. But this is equivalent to 



r]{(f>{x)-^{y),(j){x)-^{y)) = 0. 



Analogously for (j> 



-1 
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It may be philosophically more appealing to demand causality than in- 
variance of the light cones but it should be noted that our original version 
is closer to the actual experiments motivating special relativity. 



1.4.2 Length contraction and time dilatation 

Since there are no absolute time or absolute space it should not come 
at a surprise that lengths in space and time -differen ces are observer- 
dependent. To simplify notation let l|'u;|| — y/r}{w,w) for any spacelike 
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vector w. Fix an event x e and an infinitesimal observer t and con- 
sider a rod which rests in the affine rest space x + of t If with respect 
to the observer its endpoints are given by x and a: -h at a time to, then 
it will sweep out the subset 

S ^ {z : 0 = or + + /it, A e [0, 1], /i € R} 





Pig. 1.4.8. Time dilatation 



in spacetime (cf. Fig. 1.4.7). We complete {t,ei = tl\\i\\ to an orthonor- 
mal basis {t, ei, . . . , Cn-i} of A^, Now consider a second infinitesimal ob- 
server i' who moves (relative to t) with the velocity v = ||u|| 'Ci. Let L be a 
Lorentz transformation which m aps t into t' and leaves span{t, ei} invari- 
ant. We haveF = Lt = (i+u)/^/! - ||u||2, Lei = (ei + ll^||t)/yi - ||uP, 
and Lei ~ Vi € {2, ... ,n - 1}. The rest spaces relative to M are all 
parallel to Li^. In order to determine the length of the rod with respect 
to i' we must calculate the length of 



S' n (x Lt"^) = 



+ q: 



ei + ||i;||t ^ a 




It follows that ||£'|| = v^l — lliip - ||^|| < ||^||. Hence the infinitesimal 
observer t' measures a shorter length than t. This Lorentz contraction 
is one of the reasons why initially many physicists found special general 
relativity hard to understand. It should be remarked that an investiga- 
tion of 3-dimensional objects shows that the Lorentz contraction is more 
like a rotation. In particular, a moving sphere looks like a sphere at rest. 
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There is a similar effect with respect to time, the time dilatation. In 
spacetime, a time interval At with respect to t is given by the set 

T = 4" tt d* ft : t ^ [O, Z\t], ft € j- 

(cf. Fig. 1.4.8). In order to calculate At' we must consider the subset 

of spacetime. It follows that At' ~ < At. This is the 

reason for the twin “paradox^’ : Consider two twins, one of them staying 
at home, the other one travelling with high velocity away, and then, after 
some years coming home. Afterwards the twin who had travelled will be 
younger. Let t be the infinitesimal observer associated with the first twin, 
to be the infinitesimal observer of the second twin at her outward journey 
and ir be the infinitesimal observer of the second twin during her return 
journey. As a consequence of time dilatation, the time lapse between 
outset and return with respect to the travelling twin will be shorter then 
the time lapse with respect to the twin at rest. Hence after her return 
the twin who has travelled will be younger than the other one. If the 
time interval and the involved velocities are large enough the effect can 
be spectacular. 




This effect is purely geometrical and has nothing to do with acceler- 
ation. The fact that the situation is not completely symmetric, i.e. that 
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the twin who had travelled changes her direction and therefore is not an 
inertial observer is only needed to get her back but has nothing to do 
with the effect. To make this point clearer imagine that spacetime would 
be cylindrical. To be concrete, let y be an event such that v = y^—x is 
spacelike and consider the hyperspaces = {z € A : rj{z—x,v) = 0} 
and Fy = {z e A : r}{z—y^v) = 0}. These two hyperspaces enclose a 
region M which is bounded in the direction of ±u. Now assume that 
spacetime is just the set M where and Fy are identified. It can be 
shown that locally it is impossible to differentiate between (M, r/) and 
{A^,rj), (We will not prove this fact. It will become clear in the next 
chapter). If the twins lived in {M^y) instead of (A”',r/), the sister who 
travels would not need to turn back. As soon as she arrives at Fy (or 
Fx) she would (by our identification) be at the other hyperspace and 
therefore on the other side of her sister. Just travelling on she would 
eventually meet her sister again. If n = 2 then M is just an ordinary 
cylinder and it is easy to visualise the whole setup (cf. Fig. 1.4.10). 




Fig. 1.4.10, The twin 
paradox in a cylindrical 
universe 



1.4.3 Relativistic particles and photons 

We give a brief outline of the elementary concepts of photons, par- 
ticles and their collisions. Their non-relativistic analogues have been 
introduced in the section on the non-relativistic theory of particles 
(cf. p 20). The content of this section will be used to motivate def- 
initions in Chap. 5. Part of the physical discussion in Chap. 6 also 
requires this section. 

The following is the direct analogue of Definition 1 .2.3 for a non-relativistic 
particle. 
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Definition 1.4*7* A special-relativistic particle with mass m is a pair 
(m)7) where m E and 7 is a curve satisfying 77(7,7) = —1* The 
curve 7 is called its world line in spacetime. 

A special-relativistic inertial particle is a particle 7 which satisfies 
7 = 0. 

Unlike in the non-relativistic case, we have now only one law which cov- 
ers collisions. Let (mi,7i)i=i,...,/c denote the incoming and (m'«, 7'-)j-=i,...,z 
outgoing particles of a collision. Then conservation of momentum is ex- 
pressed by the single equation 

k i 

rriiji = ^ K'Vj- (1.4.12) 



Choose any infinitesimal observer t and denote the projection to the 
orthogonal complement of t by v ^ v. Then the momentum m'y splits 
into spatial and temporal parts as follows: 



m 

^7 




Conservation of the spatial momentum takes the form 

SL,-". ; 3 



(1.4.13) 



which is a modified form of the non-relativistic conservation law (ii) on 
page 21. Energy conservation takes the form 

(1.4.14) 

i=l j-1 



Since 



m 



— ^ = m + -mlliTlI^ + odltlf ) 

vi - ll^'lr ^ 



we recover as an approximation the non-relativistic law of mass conser- 
vation. If the equality rrii = Y!j=i holds exactly we also recover 
an approximation of the conservation law for kinetic energy. The rela- 
tivistic version of the conservation laws is not only more transparent and 
economical, it also leads to important applications: 



Example I. 4 .L If one bombards lithium '^Li with hydrogen one ob- 
tains helium ^He according to the nuclear reaction 



-b ’’'Li 2 + energy. 
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The weights of 1 mol (i.e, 6.02213 • 10^^) atoms of hydrogen (respec- 
tively, lithium, helium) are l,00783g (respectively, 7.01601g, 4,00260g). 
It follows that the final product is lighter by about 0, 01864g/mol hy- 
drogen atoms. According to the energy conservation theorem above, this 
mass difference corresponds to the energy E ^ 1,864 • 10“^kg • <? ^ 
1,864 - (2.9979) • 10^^ mkg/s 1.67525 • This energy is huge in 

comparison to the amount of material involved. The energy amount of 
an adult is about 8000 kJ per day or 2920000 kJ per year. It follows 
that 1.67525 • 10^^ J would last a human being for more than 500 years. 
Observe that the amount of energy which can be obtained from nuclear 
fusion is huge because the velocity of light c is extremely large. In this 
book we chose natural units where a length unit is defined as c-time unit. 
These units are appropriate to discuss relativistic effects on a theoretical 
level but obscure the fact that velocities in everyday live are negligible 
with respect to c. 

A photon is (classically) characterised by its velocity and its frequency. 
From elementary quantum mechanics one has the relationship E — hv 
where h = 6.62608 • lO^^'^J s is the Planck constant and u the frequency 
of the photon. These quantities uniquely determine the momentum p of 
the photon. Let t be an infinitesimal observer and E = hu be the energy 
of the photon as measured by t. If c is the spatial velocity of the photon 
relative to t then its momentum is completely determined and given by 
p = E{i + c/||c]|). Any other infinitesimal observer P measures the light 
frequency i/' = -rj{i',p)/h. The frequency of visible light light ranges 
from about 4 • 10 oscillations per second (infra red light) to 8 • 10 ^^m 
oscillations per second (ultra violet light). 



[p. 40 1 ] 
i P- 47 





2. Analysis on manifolds 



p. 4, 

Special relativity can only he expected to be a good description locally. 

We will assume that special relativity is an exact SnfinitesimaV’ de~ [j p. 54] 
scriptionf i.e. that it holds as a first order approximation near any 
given point, A rigorous formulation of this idea requires manifolds 
and tensor fields which will be introduced in this chapter. 

This chapter contains much more material than is necessary for 
the understanding of the following Chop. 3. While in an ideal worlds 
all this material would be standard knowledge of mathematicians and 
theoretical physicists, we give a self-contained treatment for those read- 
ers who still have to learn about analysis on manifolds. We will be a 
little more general than necessary. Instead of using the field R we will 
develop the theory for both fields R and C and write K if a statement 
is valid in both cases. This generality is not needed for the main pur- 
pose of the book, i.e., for presenting the theory of spacetime. However, 
both physicists who go on to study gauge theory and mathematicians 
who are interested in differential geometry as a mathematical disci- 
pline will benefit from this generality. It is also instructive to see which 
concepts depend on the real structure. Writing IK instead of R if pos- 
sible will not introduce any additional difficulty. 

While everything presented here will be used somewhere in the 
book, readers primarily interested in space and time may want to skip 
material where possible and come back later to it when needed. The 
minimal amount the reader should know in order to pass on to the 
next chapter is 

1. The definition of manifolds: Sect 2.1 up to Sect. 2.1.1; 

2. The tangent bundle: Sect. 2.2; 

3. Tensors and tensor fields: Sect. 2.3 up to including Definition 
2.3.7, Sect 2.3.2; 

4 . Connections: Sect. 2.6 up to including Definition 2.6.2; 

5. Examples of connections: Sect 2.7. 

There is a conceptional problem with the theory developed so far. Prom 
our local experiments we implicitly extrapolated a structure of spacetime 
which has only been tested in a small part of the small spacetime region 
which is inhabitated by human beings. The Michelson Morley experiment 
only indicates that Minkowski spacetime is nowadays a good description 
of the spacetime structure of a (comparatively small) earth-bound lab- 
oratory. A weak form of the philosophical Copernican principle states 

^ The guide in the margins assumes that the reader has no knowledge of 
analysis on manifolds. 



M. Kriele: LNPm 59, pp. 47 - 149, 1999 
© Springer- Verlag Berlin Heidelberg 1999 





48 



2. Analysis on manifolds 



that our position in spacetime is in no way special. In particular, at any 
other event in spacetime one would observe the same physical laws. Nat- 
ural sciences which go beyond the mere cataloging of phenomena would 
be impossible without adopting this principle. Hence we feel justified to 
extrapolate that 

every event in the universe has a neighbourhood whose space- 

time structure is well described by Minkowski spacetime. 

We will see that this extrapolation is very different from the naive pos- 
tulate that the universe has globally the structure of a Minkowski space- 
time. 

As a first step we will have to find out how to connect our local 
Minkowski spacetimes. To give a simple example which exhibits part of 
the problem consider the torus which can be obtained from via 
the identification x ^ x aei for all a e Z, a; € where 
is a fixed standard basis of . We can equip in a natural way with a 
Minkowski metric induced by A^. While locally there is no possibility to 
differentiate between and both spaces are globally very 

different (cf. Fig. 2.0.1). 




We will now develop the mathematical techniques needed to globalise 
the structure given by our collection of Minkowski spacetimes. First note 
that we cannot even expect that locally Minkowski spacetime is an exact 
description. But it is reasonable to expect that Minkowski spacetime 
is the better an approximation the smaller the subset of events we are 
considering is. This means that we will have to formulate the theory 
infinitesimally. 

In Sect. 2.1 we will generalise (part of) the structure of A^ to man- 
ifolds which may be thought of as a collection of local A’^s. We will 
then construct tangent spaces to a manifold which are the infinitesimal 
approximations of it (Sect. 2.2). 



2.1 Manifolds 

In this section we will localise part of the structure of and lay the 
foundation for calculus. 
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One of the most important structures of is given by the collection 
of all open subsets, because this collection is needed when one defines 
limits, the most basic notion of analysis. Since can be thought of as 
with the special properties of the vector 0 been forgotten about there 
is a straightforward way to define open sets in A^. The subset W C A^ is 
open if and only if there is a point o € A and an open set U CW^ such 
that 14 = V V ^ 

We will now localise the topological structure of A^, i.e., the part of 
the structure of A^ which tells us which subsets of A’^ are open. 

Definition 2.1.1. A topological space (M, r) is a set M together with 
a collection r of subsets of M which satisfies the following properties, 

(i) 0 e r, M e r; 

(ii) U,V er^UnV er; 

(in) if A is an index set andUa G r for each a € A then ^ 

The collection r is called the topology of M, A set U C M is open if 
U ^ T and is closed if M \ U £ r, A setV C M is a neighbourhood of 
a point X £ M if there is anU £r with x eU cV, 

It is clear that the collection of open sets of (and therefore also of A^) 
satisfies properties (i)-(iii) of Definition 2.1.1. This justifies the definition 
of a topological space. On the other hand, a general topological space 
may have properties which are quite pathological. This can be seen from 
the following two extremal examples. Let M be any set and define Tfine 
to be the set of all subsets of M and Tcoarse “ {0, M}. Then (M,Tfine) 
and (M, Tcoarse) topological spaces. 

Definitions which can be stated purely in terms of open sets carry 
over to topological spaces. 

Definition 2.1.2. Let (M,r) and(M,f) he topological spaces. 

(i) A map f:M M is called continuous if f~^{U) e r for all 
U £f. A bijective, continuous map whose inverse is also continuous 
is called a homeomorphism. 

(ii) A subset U £ M is compact if for every collection of sets {Ua]a€A 

withUa £ T andU C there are finitely manyUa(i)j . . >L(a(k) 

withU 

(Hi) A topological space (M, r) is connected ifU^ V £ r withUOV = 
0 and U U V = M are necessarily of the form U = M, V = 0 or 
V ~ M, U = 0, For the topological spaces we are interested in 
(cf. Definition 2.1.4 below) this is equivalent to the requirement that 
any two points can be connected by a continuous curve [0, 1] — + M. 
(iv) A collection of open subsets {lAa}a€A is a basis of the topology 
r if for every open set V there is a subset B C A with 
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(v) A subset U dV is called dense ifV ~U where U is the closure 
ofU, i.e,, the smallest closed set containing U . 

(vi) A collection of open subsets {Ua}a€A is a sub-basis if all finite 
intersections of sets Ua form a basis of the topology r. 

Lemma 2.1.1. Let M be a set and {Ua)a^A be any collection of sub- 
sets of M which satisfies [Ja^A^ci = M. Then there is uniquely defined 
topology T of M such that {Ua}aeA together with the empty set 0 are a 
sub-basis ofr. 

Proof This follows immediately from the definition of a topology, I 

We can now describe those topological spaces which cannot be locally 
distinguished from A^. Let (M, r) be a topological space which is Haus- 
dorjf, i.e has the property that for any two different points x^y e M 
there are neighbourhoods ^ of a;, V of y which satisfy ^ fl V = 0. The 
topological space (M,r) is locally indistinguishable from (considered 
as a topological space) if each x G M has an open neighbourhood U 
such that there is a homeomorphism (p: U V C The pair (U,(p) 
is called a topological chart of M. Since for each open subset U CU the 
restriction oi p to U is also a homeomorphism onto its image, we have 
the following compatibility property. 

Let (W, ip) and (W, (p) be topological charts of (M, r) with UdU 
0. Then the map (p o ip ~^ : ip{U) n (p(U) -4 ip[u) fl (p(l() is a 
homeomorphism. 

We would like to have not only a topological structure on M but also 
a structure which allows us to use the tools of analysis. Unfortunately, 
there is not an independent definition of a “differentiable space” which 
is analogous to the definition of a topological space. To get an idea how 
this difficulty can be overcome we can view the charts (W, ip) as a way to 
induce the local topological structure of on M. To be more precise, 
let M be any set, be a set of subsets of M with ~ 

M, and ipi'. Ui Vi bijective maps onto open subsets of R’^. We can 
now attempt to define a topology on M by using the sets {q^f^iyVi) : 
Wi C Ui is open} as a sub-basis for the topology of M, In order to get a 
topology consistent with a local description we have now also to demand 
the compatibility property above. Still, the resulting topological space 
could fail to be Hausdorff (cf. Fig. 2.1.1). Since this is a local property, we 
will demand it in addition. We have now defined a topological structure 
on M which is locally indistinguishable from the topological structure of 
MP. This definition can be carried over to the differentiable structure. 



^ Recall from the definition of the topology of A’^ that the map ^ , 

X X — o is a, homeomorphism. 
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identify 



Fig» 2.1.1, A topological space 
which is locally homeomorphic to 
M but fails to be Hausdorff 



Definition 2.1.3. Let I be an index set and (M, r) be a topological space 
which is Hausdorff, A atlas of M is a collection <Pi \ Ui <Z M 
(i e I) of local homeomorphisms such that 

(i) each Ui is open and connected, 

(ii) each x € M is is contained in some Ui, 

(Hi) for each i,j with Ui nUj ^ ^ the map o : (fj(fUi C\Uj) 
(fi{Ui OUj) is a -diffeomorphism. 

The pairs (Ui,(pi) are called charts of M, A chart (Ui,cpi) is centered at 
X € M ifx € Ui and (pi{x) = 0. Two charts {Uai To) {o^ = 1, 2) are called 
compatible if they satisfy the compatibility condition (Hi), Two atlasses 
are compatible if each pair of charts in their union is compatible, A C^- 
atlas A is called maximal if any -atlas containing A coincides with 
A, 

Remark 2.1,1, In the case K — C every (7^-atlas (fc > 1) has the prop- 
erty that (pi o is analytic, i.e., is locally given by its Taylor series. 

This follows immediately from the fact that C-diflFerentiable functions 
are analytic. 

For technical reasons (cf. Sect. 2.1.2) we will also demand that the topol- 
ogy of M has a countable basis. This means that there are countably 
many sets such that any open set W is the union of sets Vi, 

Definition 2.1.4. Let (M, r) be a connected topological space which is 
Hausdorff and which has a countable basis. (M, r) together with a max- 
imal -atlas is called a C^-manifold. A -manifold is also called 
a smooth manifold. We will often refer to smooth manifolds simply as 
manifolds. 

A subset N C M is an m-dimensional submanifold of M if for each 
X £ N there is a chart {U,(p) of M centered at x such that (p{N DU) — 
(p{U) n {y e = 0}. An (n - 1)- dimensional 

submanifold is often called a hypersurface. 

Observe that a subset N C M can be a manifold without being a sub- 
manifold of M (cf. Fig. 2.1.2). 

The following lemma guarantees that a manifold is determined by any 
(not necessarily maximal) C^-atlas (fc > 1) which is compatible with the 
given maximal C^-atlas. In practice, it is therefore sufficient to work 
with any given atlas. It can be shown (Hirsch 1976) that any maximal 
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Fig. 2.1.2. A manifold M C which is 
not a submanifold of 



C^-atlas contains a subatlas which is C^. Hence for most questions it 
is no loss of generality to consider only smooth (i.e., manifolds.^ 
On the other hand, it should be noted that C^-manifolds are really 
more general We will not consider such manifolds in this book. 

Lemma 2.1.2. Let A he a -’atlas. Then there is a unique^ maximal 
-atlas containing A. 

Proof. Let B be the set of C^~charts which are compatible which each 
chart in A. Clearly, A c B, Any two charts (Vi,V^i) and (V 2 ,^ 2 ) in B 
are compatible. To see this let a: G Vi O V 2 and (ZY, (p) be a chart in A 
with X £ti. Then the maps ip o and ^ip 2 ^ in the open 

set n Vi n V 2 ). It follows by composition that -02 ° is also 

C^. That o is can be shown in the same way. It remains 

to prove that B is maximal and unique. The first assertion follows from 
the definition of B. Assume that B' is another atlas containing A. Since 
every chart of S' is compatible with each chart of A, it must belong to 
B by the definition of B. Hence B^ C B and the second assertion follows 
as well I 



Example 2.1.1. Consider the cylinder which can be obtained by identi- 
fying opposite sides of the rectangle [ai, 61] x (a2, 52), 

{(ai,y) : y € [a2,M} ~ {(^i>2/) : V ^ [a2,b2]}- 

As charts we can take the maps 



(fi: ([ai,6i] X {a 2 ,b 2 ))\ ^ (^2,62) 

|(x- (62 -ci2),2/) forx>^i^ 

¥^2; X (02,^2) 

(x,y) i-> (x,y). 






^ It can in fact be shown that there is always an analytic sub atlas. However, 
it is not a good idea to restrict to analytic atlasses because then some 
important technical tools do not work (cf. Sect. 2.1.2.) 
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Consider a (long) rectangular strip of paper. The two shorter opposite 
edges can be glued together in two ways. Either one obtains a cylin- 
der (cf. Example 2.1.1) or a figure which looks like a cylinder with a 
twist. This Mohius band can mathematically be constructed as follows 
(cf. Fig. 2.1.3). 



identify 




identify with flip 

Fig. 2.1.3. The construction of a Mobius band 



Example 2A.2 (Mobius band). Let Vi — (0,2a) x (— 6 , 6 ) C and V 2 = 
(—2a, 0) X (— 6 , 6 ) C R^. We define on Vi U V 2 an equivalence relation ~ 
by 

- 3a, a;^) if € (a, 2 a) x (- 6 , 6 ), 

(x^,x'^)r^(x^ - a, -a:^) if (x^,x^) G ( 0 ,a) x (- 6 , 6 ) 

and the manifold M by M = (Vi U V 2 )/^. Denote the canonical pro- 
jection Vi U V 2 — ^ M, (a;^,a:^) i-> [(x^.x^^)] by tt and let Ui = 7 r“^(Vi), 
(^ ^ {^>2}). Then {{Ui^ipi), (i^ 2 ,<^ 2 )} is an atlas of M. 

Since manifolds have a differentiable structure we can define differen- 
tiable maps on manifolds. 

Definition 2.1,5. Let M,N be -manifolds and k <L A map f: M — > 
N is C^-differentiable if for every two charts {Ua^^a)) (^ 5 ,^ 5 ) of M^N 
the composed map '06 ° ° (^)~^ i^s a C^-map. A -differentiable map 
is called smooth. The set of all -differentiable maps from M to N is 
denoted by C^{M, N). 

The maps g: M and h: M are C^- differentiable if 

for every chart {Ua, 99 ^) the composed maps g o ((/?a)“^ : and 

<pa ^ 9' are -differentiable. The set of all -differentiable 

maps from M to and from to M are denoted by C^(M,K^) and 
C^(K^,M). 
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It is easy to see that this definition is satisfied if / is with respect 
to any given atlas. For M = the definition coincides with 

the usual definition of differentiability in elementary analysis. 

Recall from analysis that a continuously differentiable map F : — >■ 

has rank r at x 6 if the subspace DF{x)W^ C KF' has the 
dimension r. 

Let /: M be a C^-differentiable map and x € M. li {Ui(pi)^ 
{^2,^2) are charts centered at x and (V2,V^2) charts centered at 

f{x), then the rank of the maps o f o (pi~^ at (cc) and f ^^2'^^ 
at <p2{x) coincide. We can therefore speak of the rank of / at a; and the 
following definition is independent of the chosen charts. 



Definition 2.1,6« Let f \ M ^ N be a -differentiable map and x G 
M. Let (W, 99) be (any) chart of M centered at x and (V,^) be (any) 
chart of N centered at f{x). The map f 

(i) has rank r at x if 'ijj o f o has rank r at (p{x), 

(ii) is an immersion at x ifD(ipofo(p~~^) is an injective linear map 
at ^{x)f 

(Hi) is an submersion at x if D^'tp o f o i$ a surjective linear 
map at at (fix), 

(iv) is an local diffeomorphism at x if D(^ 0/0 is a bijective 
linear map at (p{x). 

Lemma 2*1.3* Let f:M-^Nbea -differentiable map of rank r at 
X e M. Then there is a neighbourhood W of x such that for each y €W 
the map f has rank ry >r at y. 

In particular, if f is an immersion (respectively, a submersion, a 
local diffeomorphism at x) then it is also an immersion ( respectively, a 
submersion, a local diffeomorphism) at any y € W. 



[p. 47 i] 
i P. 61 



Proof Since D{'ipofoip~'^) is continous the existence of r linearly indepen- 
dent vectors in fo(p~'^)[(p(x))KP' implies that for y close enough to 
X there are also r linearly independent vectors in {Di/;ofo(p~'^)(ip(y))K^. 
Hence the rank of f cannot fall in a sufficiently small neighbourhood of 
a given point. For the second statement observe that immersions, sub- 
mersions, and local diffeomorphisms all have maximal rank. I 



2 .1.1 Construction of manifolds 

In analysis, the inverse function theorem plays a fundamental role 
because it allows to draw local conclusions from infinitesimal assump- 
tions. In this section we show that the inverse function theorem and 
also similar theorems carry over to manifolds. A special case (Propo- 
sition 2,1.1) allows a construction of submanifolds without specifying 
an Atlas. 
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We will occasionally use the results of this section. But the reader 
who is not primarily interested in analysis on manifolds is advised to 
skip this section and to return to it when needed. 

The following lemma is a consequence of the inverse function theorem. 

Lemma 2.1.4. Let U C V C be open, x eU, and / : W V 6e 
a continuous, differentiable map with constant rank r in U. 

Then there exist 

(i) an open neigbourhood U cU of x, 

(ii) a homeomorphism U {y eK^ : \y\ < 1}, 

(Hi) an open neighbourhood V D /(W) of f{x), 

(iv) and a homeomorphism 'ip: {z e : |2:| < 1} V 

such that f ^ 'ip opr of) where Pr{y^, ...,y^) = (y ^, ... 0, ... ,0). 

Proof. Let E he the (n — r)-dimensional vector space E — {v E MP : 
Df{x)v = 0} and F C be an r-dimensional vector space with = 
E®F. Let {e^+i, . . . , Cn} be a basis of E and {ei, . . . , e^.} be a basis of 
For each y E let Ag(y) be the ith. component of y with respect to the 
basis {ei, . . . , Cn}. The vectors fi = Df{x)ei (i E {1,..., r}) are a basis 
of the r-dimensional vector space Df{x)K^ C IK^. We choose linearly 
independent vectors /r+i, . . . /m such that {/i, . . . , f^} is a basis of KF. 
For z e let p'^fiz) be the ith component of with respect to this 
basis. We define 




The map 

g:U^ K"", y ^ po f{y) + \{y) 

is continuously differentiable and I)g{x) : is invertible. Hence 

by the inverse function theorem there is a neighbourhood U C U of x 
such that ^ is a diffeomorphism from U onto the open set gihl). There 
is an e > 0 such that B^{x) \= {z E : \z - g{x)\ < e} C g(U). We 
define U := g~^{B^{x)) and the bijective map 

4>:U^ Bj{0) C K”, -^ig{y) - g{x)). 

^ This numeration of basis vectors of F 0 F may seem slightly odd but will 
prove to be more practical later on in the proof. 
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The rank of / is constant on U which implies that dim(D/(y)K^) = r 
for all y ^ U. Since Dy(y) is bijective for dl\ y ^ U and I)g{y){v) = 
fj>(Df{y)v) for aWv e F the maps D/(y) : F D/(y)F and 

li: Bf{y)F r’ 0 {0} C 0 

are both bijective. Let 0 {0} D/(y)F be the inverse to the 

latter map. 

We will now show that the map h = f o cj)~^ does not depend on the 
variables , y'^- We write 0 and v = vi ® V 2 for 

any v e K”. Since f{y) = o f{y) 0 ^X{y) -\no f{x) 0 7A(x)) we 
have 



tDf{y)v = ° M ° D/(2/)^ 

+ D2/l|i^o/(j/)e jA(y)-iM°/(»)©7’^(®) ° 

Inserting D/(y) = UyO fio Df{y) into this equation implies 

^‘^^\~fiof{y)^^\(y)~^fj,of(x)^^X{x) ^ ~ <Jy O jj, O D/ (y)u, 

where ay is the linear map given by 

ay: K^0{O} y ^ ei^y -I)ih^i^^f^y>^^ix{y)-^fiof{x)^^x{x)’ 

Since y 7M^/(?/)® e^(^) invertible and A maps onto {O}0K^"'^ 
we have only to show that dy — 0 in order to prove D 2/1 = 0. For v € F 
we have A(r;) = 0 and and therefore ay o fj, o Df{y)v = 0. The map 
yboDf coincided on F with Dy(y) and is therefore bijective. In particular, 
yt o DfF = K’’ 0 {0} which is the domain of ay. Hence ay vanishes. 

Since we have proven that h{y) does not depend on • • - y^ we 
can write h(yi) instead of h(yi 0 y 2 ). 

We identify with 0 and write w — w\ ^ wz. Let 

{^1? • • • ? be the canonical basis of and let r: ^ 

span{/r+i,. . . ,/m} be the linear map which is defined by r{bi) — fi. 
We define the map ip by 

Ip: Bi{0) C -> zi^zs^ h{zi 0 ji{f{x))) 0 rfys). 

This implies 

-tpopro 4>{y) ^'tpo pr{-^g{y) - -^g{x)) 

= ij)op^{-lj,o f{y) - -pof{x) + 0(y) - jA(x)) 

= Iv' ° PrilJ' ° f{y) - M ° f{x)) 
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= O f{y) -1X0 f(x) + M(/(a:))) = ° /(y))- 

Denote the projection of /i to the span of the first r canonical basis 
vectors of by /I. Then we have h o jl = h o fi. Equation T>2h = 0 
implies f{y) = KlKfiv))) which in turn gives i)op^o cj>{y) = f{y). 

We still have to show that -0 is a homeomorphism. But this follows 
since both h and r are homeomorphisms. ■ 



Corollary 2.1.1* Let Mhe an n- dimensional, N be an m-dimensional 
-manifold, and f : M ^ N be a -differentiable may which has con- 
stant rank in a neighbourhood of x G M. Then there exist charts {U,(p), 
(V,^) centered at x, f{x) such that (p{x) = 0, 'ip{f{x)) = 0, and 

Proposition 2.1.1. LetMbe an n- dimensional, N be an m-dimensional 
-manifold, and f:M~^N be a -differentiable may which has con- 
stant rank in a neighbourhood of x € M. Let y € f{M). Then the set 
f~^{y) ® closed (n ~ r) -dimensional submanifold of M. 

Proof Let x e /“^(y) and take charts as given by Corollary 2.1.1. Then 
we have 



¥’(Wn/ '^{y)) = (p{U)n{‘ipofo(p ^(o) 

= if{U) n { 2 : e K” : z^ = --- = z^ = 0}. 

The assertion follows directly from the Definition 2.1.4 of a submanifold. 



Proposition 2.1.1 is a powerful tool which is used to construct manifolds. 

Example 2,1,3. Consider the submersion /: \ {0} — > R, x |xp. 

Proposition 2.1.1 implies that the sphere of radius p > 0 in M which 
coincides with the set is an (n - l)-dimensional submanifold of 

R’^. The reader try for her- or himself to write down directly an atlas of 
the sphere of radius p. It is much more work. 



2.1.2 Partition of unity 

In this section we provide a method of localisation using functions 
which is practical if one wants to define a global object using charts. 
The prime example is the definition of integration in Sect. 2.5.4. This 
method works only for real manifolds: K = R. 

This subsection is somewhat technical The reader may therefore 
want to skim (or skip) this section on first reading and to return to 
its proofs when the results of this section are needed. 
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The aim of this section is to construct an atlas and for each chart a 
function with support in this chart such that 

(i) at any point there are only finitely many charts which intersect 

(ii) the sum of all functions (which is well defined by (i)) equals 1. 

We start with two topological lemmas. 

Lemma 2.1.5. Let M be a manifold. If M is not compact then there 
exists a sequence of open sets {W^}iGN “with compact closure such that 
WiCWi+i and[j,^^Wi^M. 

Pro^. Let be a countable basis of the topology of M such that 

all lAi are compact. Let Wi U\. Since the closure of this set is compact 
there is a fci G N with Wi C W 2 * We proceed now by in- 

duction. Assume, we have already constructed Wi, . . . , Wp, where Wp = 
Uti Then there is a fcp+i > kp such that Wp C — • Wp+i. 



Lemma 2.1.6. Let M be a manifold. And {Ua]a^A be open sets which 
cover all of M. Then there exists a countable collection of open 

sets such that 

(i) each Vj lies in some Ua and has compact closure^ 

(ii) each Vj intersects only finitely many Vi. 

(in) 

Proof. We will first show that we can restrict to the case that A is 
countable. Let O ~ be a countable basis and let {(Vc, <^c)}ceC be 

an atlas of M. For each Vc there are countably many sets Oi^c ^ O such 
that Vc “ USi P^i,c- Since O is countable so is the set C O. 

Let be a re-numbering of these sets and choose for each j G N an 
index c{j) € C with c Vc{j)- Then the collection 9^c(j))}^'GN is a 
countable atlas of M . Since is homeomorphic to there is for each 
j a dense sequence of points in OT For each (i, j) € N x N let 

a(t, j € A) be an index with Xij G Then the countable subset 

{Ha(i,j)}hjef^ of \fAa\aeA covers all of M. It follows that we can assume 
without loss of generality that A is countable. 

Let Wo = 0 and let {W^}jeN be the sequence of sets constructed 
in Lemma 2 . 1 . 5 . The set Wk^i \ Wk is compact and can be covered 
by finitely many for every fc G N. We set Vk,i = 

P^ai(k) Fi (Wfc+2 \ W/c-i) which defines a countable family of sets since 
N X N is countable. Property (i) is clear from the definition of Vk,i and 
property (iii) follows from the fact that the sets Wk+i \ Wk cover all of 
M. Finally, property (ii) follows because each set Wk is only intersected 
by finitely many Vk,r ® 
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Lemma 2.1.7* Let M be a real manifold andU^ V open sets withU C V. 
Then there is a -function /i: M — ^ [0, 1] with 

(i) h{x) — 1 for all x eU, 

(ii) h{x) = 0 for all x £ M \ V . 




Fig. 2.1,4. The proof of Lemma 2.1.7 



Proof. We consider first the special case of two balls with origin 0 e 
but different radii: Let 0 < n < r2 and Br^{0) = {a: G R^ : |rrl < ri}, 
Br^iO) = {a: G R^ : \x\ < r2}. The map 



9ri,r2 • 



R+ ^ R+, 




for ri < t < T2, 
otherwise 



is and has the support [ri,r2]. It follows that 



9ri,r2i^) ~ 1 



Ir, gri,ri,(s)d3 

/r? 9ruv,{s)ds 



is also The properties 



§ri,r 2 (t) = I for ^11 i ^ [0,ri], 

^n,r2(0 ^ (0^1) for all t G (ri,r2), 

§n,r>2(^) = 0 for all t G [f2, oo) 

imply that the (7°^-function 

Ki,T2 : [0, 1] ^ R'*’, X 9rur2iM\) 

is well defined and satisfies supp(h^j,r^2) = ^7.2(0), h^i,r2(^) = I for all 
xeBrT,{0). _ 

Consider now open sets U, V with U C V and let be a 

collection of open sets as provided by Lemma 2.1.6. For each j we will 
now construct a smooth function hj which satisfies 
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(i) hj{x) e [0, 1 ] for all x £ M, 

(ii) hj{x) = 1 for all a: E W n Vj, 

(iii) snpp(hj) C V n Vj. 

Let X e U CiVj and (fx) be a chart centered at x with (px{x) = 0 

and Wx C V DVj. There are positive numbers ri{x) < r 2 {x) such that 
Br^(x){0) C (px(Wx)^ The map 



hi: W,^R+, 



y ^ 



^ri{x) ,T2 {x) ^ ^x{y) 
0 



for y £Wx 
otherwise 



is well defined and smooth. Since U fl Vj is compact there exist finitely 
many points xi, . . . , .Tjv such that the open sets 

{.^Xk {Bri{xk){^))}k~ly..,K 



cover UnVj, Hence the map hj{y) 1 - n^i(l “ H(y)) is also well 
defined and smooth. Clearly we ha ve hj{y) e [0, 1 ] for all y e M. Since 
fixkj y) = Q for all 2 / E M \ V n Vj we have also hj(y) — 0 for all y E 
M \ V n Vj. If 2/ E W n Vj then there exists an xi with (y) = 1 which 
implies hj{y) — 1. 

The function h{x) := ^j(^) is well defined since for each x all 

but finitely many hj{x) vanish, h satisfies supp(h) C V, j{x) > 0 for all 
ir E M, and h{x) > 1 for all x e hi. Hence in order to prove the lemma we 
only need to normalise h appropriately. Let U = {x E M : h{x) < 1/2}. 
This set is open and its closure is contained in M\W. Hence by the same 
construction there exists another smooth function h with h{x) > 0 for 
all a: E M, h{x) > 1 for all x e U, and supp(h) C M\U. Observe that 
h and h do not vanish both at any given point. Hence 



h{x) = 



h{x) 

h{x) + h{x) 



is well defined and smooth. Further, h{x) E [0, 1] for all a: E M. If a: E ZY 
then h{x) > 1 and therefore h{x) — 0 which in turn implies h{x) = 1. If 
x £ M \ V then h{x) =0 and therefore h{x) — 0. I 



Definition 2.1.7, A smooth partition of unity is a set of functions 
{fa - M [0, l]}ae^ such that 

(i) each point a: E M has a neighbourhood which is only intersected 
by the support of finitely many fa, 

(ii) I2a€A /a(^) = 1 

A partition of unity is subordinate to an open covering {Us\beB ‘If for 
every a E A there is a b e B with supp(/a) C 
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Theorem 2.1.1 (Existence of a partition of unity). If {Ut}b€B is 
an open covering of a real manifold M then there exists a countable 
partition of unity {fj}jeN which is subordinate to {Ub}beB’ 

Proof For each x e M let b{x) be an index with x e Ub{x) and U^x) be a 

neighbourhood of x with Ub{x) C Ub{x)- Lemma 2.1.6 implies that there 
exists a sequence and a countable collection of open sets {Vj} 

such that 

(i) each Vj lies in some and therefore Vj C 

(ii) each Vj intersects only finitely many V^; 

(iii) M = 

We can now apply the same argument to the collection of open sets Vj 
to find a countable collection of open sets such that 

(i) each Wk lies in some V^-(fc); 

(ii) each Wk intersects only finitely many W;; 

(iii) M = [Jken^k. 

By Lemma 2.1.7 there is for each k a function h fc: M [0,1] with 
hk{x) = 1 for all x £Wk, hk{x) = 0 for all a; 6 M \ Vj(/c). Since each Vj 
intersects at most finitely many Vi for each x e M we have hk(x) = 0 for 
all but finitely many k. This implies that the sum x ^ f^k{x) is well 

defined and smooth. Since each x lies in some Wk we have hk{x) > 
1 for ell X e M. Thus 

is well defined and smooth. Property (i) of Definition 2.1.7 is satisfied 
because each Vj intersects only finitely many Vi and supp(/fe) C Vj(/c). 
Property (ii) follows directly from the definition of fk^ Hence {fk}keN is 
a partition of unity That it is subordinate to the open covering {Ub}beB 
follows from supp(/fc) C Vj{k) C Ub(x^k))' * 



Remark 2 A. 2, In Lemma 2.1.7 and Theorem 2.1.1 it was necessary to 
restrict to the case IK = M. If K == C, both results are wrong in gem 
eral. This is so because complex- differentiable maps are automatically 
analytic, i.e., can locally be written as a power series. This would be 
impossible for the function h in Lemma 2.1.7. 



2.2 Vector bundles and the tangent bundle 

In ordinary calculus, the derivative D/of a map / : K”" — ^ is the lin- 

ear approximation of /, i.e., it is defined by f{x) = f{a) + Dfaix—a) + 
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o(\x - a|) where o{\x - a\)/\x - a| -> 0 (a: ^ a). To study the linear ap- 
proximation of a map rather than the map itself is certainly one of the 
most powerful approaches in mathematics and physics. Because of the 
limit a: — a — > 0 this approach is often referred to as working infinites- 
imally. Until the middle of this century people spoke of infinitesimal 
(or infinitely small) displacements (meaning the vector x—a if it was 
‘small’). This terminology can lead to misunderstandings but stresses 
the main idea of analysis. While we will give a modern presentation, it 
is a good idea to keep the ‘infinitesimal way of t hinkin g’ in mind. 

The definition of a linear approximation of a function / rests on the 
linear structure of K". In the general case of a manifold, such a structure 
is not at hand. But it is possible to define a linear approximation of a 
map in two steps. First, we linearise the manifold itself. This gives rise 

^to the the tangent space TaM at a point a of a manifold M. Then we 

linearise the map thereby obtaining a linear map Taf : TaM Tna)N 
1 p. 63 I between (linear) tangent spaces. 

We will linearise the manifold M by attaching an n-dimensional vec- 
tor space to M at each point x e M, At first one may think that it is 
sufficient to consider the product M xK^ and to define T^M = {x} x K^. 
However, this would introduce a global structure via the (global) prod- 
uct X. In order to keep within the spirit of localisation, we can only 
demand that such a product exists locally. This motivates the following 
definition. 

Definition 2*2.1. A /c-vector bundle (£, tte, M) over an n- dimensional 
manifold M is a triple consisting 0 / a (fc + n)- dimensional manifold E, 
and submersion tte- E M such that 

(i) For each x € M is {7 te)~^{x) a k- dimensional vector space over 
K, 

(a) for each x e M there is a neighbourhood U and a diffeomorphism 
U X (7te)-1(W), 

where for each y £U the restricted map 

is a linear isomorphism. 

M is called the base manifold^ tte the projection, Ey := {'KE)~^{y) 
the fibre over y, E the total space, and 'ip the bundle chart or local 
trivialisation. 

^ The set of tangent spaces of a Manifold forms again a manifold of a special 
type, a vector bundle. While we will construct many special vector bundles 
and general vector bundles are of importance in gauge theory, their general 
definition is not central to our discussion. 
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We will often just speak of the vector bundle E instead of (E, M), 

Notice that this is just a localisation of M x K^. We call (J5,7T£;, M) 
trivial if there exists a local trivialisation of the form if): M xK^ ^ E. 
In this case, E can be identified with M x K^. An example of a vector 
bundle which is not trivial is given by the Mobius band. 

Example 2,2,1 (Mobius band, continued from page 53)), The Mobius 
band M is also a vector bundle. The bundle projection is given by 
^ Pi ^ ^i{^) where pi: R^ is the projection 

(a:^, 0) and i € {1, 2} is an index with x eUi. It is clear that 
this vector bundle is not trivial. 

Definition 2.2.2. Let E be a vector bundle, A map a: U C M E 
which satisfies nsio-ix) = x for all x eU is called a section of the vector 
bundle E, 

A collection {<ti, , ,ak} of sections such that 
span{(Ti(a ;), . , . ,ak{x)} Ex 
for all x £U is called a frame of E, 

The following definition will play a role later on (cf. Theorem of Frobe- 
nius 2.5.3). 

Definition 2.2.3. Let be a vector bundle, A vector subbun- 

dle {F, 7 Tp,N) of the vector bundle E consists of a submanifold F of E 
and submanifold N of M such that 

(i) TTp := (7 te)\f F N defines a vector bundle structure with base 
manifold N, 

(a) Fy is a vector subspace of Ey for all y £ N, 



2.2.1 Construction of the tangent bundle 



In affine geometry we had distinguished between points x £ and 
translations (or vectors) in the associated vector space A translation 
u: X i— > a; + u is a global concept. Notice that a translation is originally 
thought of as moving the point x along the curve 7^: [0, 1] 1— > t t— > 

X tv to its endpoint. The velocity vector of the curve is v which may 
be regarded as the infinitesimal (but in this case exact) approximation 
of 7v. Given an arbitrary smooth curve 7: (a, 6] A^, t t— > j{t), we 

take its derviative as its infinitesimal approximation at a given point 
X = 7(to). Taking all these velocity vectors at x we obtain a vector space 
TxA^ which is attached to A”' at rr. This vector space is in a natural 
way isomorphic to the associated vector space given by the translations. 
In the following we will transfer these ideas to manifolds. The main 
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difficulty we have to solve is the lack of an associated vector space in 
which we can take the derivative of a curve. 

Let M be a smooth manifold, x e M and 7: (-e, e) M be a 
smooth curve in M which passes through x at parameter value 0. Another 
smooth curve 7: (-e,e) with x = 7(0) is called x-equivalent to 7 if 
i ((^07)10 — ^ (¥^°7)|o- This definition is independent of the chosen 
chart and puts all curves through x with the same velocity into one 
equivalence class [7x]. 

Definition 2.2.4. Let M be a smooth manifold and x e M. The space 
of all equivalence classes Ylx] is called the tangent space of M at x and 
denoted by T^M, Its elements are called tangent vectors. 

We must show that T^M has indeed the structure of a vector space. 
Choose any chart {U^ ip) centered at x. This chart induces a bijective 
map 

which we can use to induce on TxM the vector space structure of K^. 
Let a eK and [7^], [nx] £ TxM, Then we define 



and 



[7x] + [Mx] := {0tr^ (e^([7x]) + 0r([7x])) . 

This vector space structure is independent of the chosen chart. In fact, 
let (V,'0) be another chart centered at x and denote 0^ o by 

\ Then we have 

(0^)-7a0^([7.D) 

= (0^)-' o 0t o (0^)-i (a0^ o (0^)-i o 0^([7.])) 

= (0^)-7a0^([7.])) 
and analogously 

= (etr^oOt o (0^)-^(0^ o (0^)-i o.0^([7,]) 

+ oto{etr^oet{M) 

= o [iop^-'r^oetib.]) 
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= {etY[0t{b.]) + 9tM)- 

We call the set TM UxeMTxM the tangent bundle of the manifold 
M. The vector spaces TxM can be understood as infinitesimal models [p. 63 i] 
of M at X, I i p. 65 

Proposition 2.2.1. Let M be a smooth manifold. There is a natural 
vector bundle {TM, M) associated with M such that {tvtm)~^{^) = 

TxM for all x e M, 

Proof, Let TM = [JxeM'^^^ define 7 rrM([ 7 a;]) = oo. We choose an 
atlas {{Ua,<Pa)}aeA of M and define the structure of a smooth manifold 
on M through the atlas {{{ 7 TTM)~^{L(a)^^a)} where 



■■ (tTTm) ^(K) K” © K", [ 7 x] l-> (Pa{x) © 6 >^“ ( 7 [x]). 



The bijections bijections 

V>a: Wa X K” (nTMYi^a), (x,v) (0'^‘'Y{v). 



are then diffeomorphisms such that {•>pa)x '■ K” -> TxM, v i-> (0^“) ^(u) 
are linear isomorphisms. I 



It is not always practical to work with equivalence classes. We will there- 
fore also give an equivalent definition which is better suited for calcula- 
tions at the cost of being less intuitive. The key observation is that each 
tangent vector [7^] G TxM induces a map 
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• if ^ C^{U,K) : W is an open neighbourhood of x} K, 

This map has the following properties. 

(i) is K-linear, 

(ii) for any smooth functions f,g: M — ^ K the derivation property 



^b]Sf9) = + /(x)D[^j^,(5) 



holds, 

(iii) for any open neighbourhood U of x and any two smooth functions 
f,g which coincide in U we have 

This motivates the following 
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Definition 2.2.5. A map 

Vx'- {f € C°°(U,K) : U is an open neighbourhood of x} 

which satisfies properties (i)-(iii) above is called a derivation. The vector 
space of derivations at x with addition and multiplication being defined 
pointwise, (aVx + bwx){f) = a{vx{f)) + b{wx{f)), is denoted by Vx- 

Remark 2.2.1. The reader may wonder why in our definitions derivations 
act on {/ € C°°(W,K) ; W is an open neighbourhood of x} instead of 
the simpler set For K = R we could indeed have chosen 

but in the case K = C there exist only very few globally 
defined differentiable maps in general. In fact, in the extreme case of the 
complex torus only the constant functions are smooth. However, locally 
there is always an abundance of smooth functions. 

Lemma 2.2.1. If f\ M is constant in a neighbourhood of x, then 
Vx{f) — 0 for all derivations Vx &T>x- 

Proof Let f{x) = a. Vx(f) = avxif/a) = auj,(l) = avx{l ■ 1) == avx{l) ■ 
l + a-lvx{l) = 2avx{l) = 2vxif). ■ 



Theorem 2.2.1. Px is an n-dimensional vector space. 

Proof. We know already that T>x is a vector space and have therefore 
only to show that dim(Pj,) = n. Let (W,y>) be a chart centered at x 
and let a:*: > K the ith coordinate component of Observe that 

we have for any h\ (p{U) -4 K the identity h{y) = h{0) + hi{y)y\ 
where hi is defined by hi(y) := ^|^dt and j/® is the ith coordinate 

in K". Applying this identity to the function / o : ip{U) ^ K we 
obtain 

f{y) = f{x)+^x\y) {f o(p-^).oip-^{y). 

i^l 

Hence for any derivation eVx to f we get 

n 

Vx{f) = Vx{f{x)) + ^ (vx{x^) (/ O O (p{x) 

i=l 

+ x\x)Vxi{f OLp-'^).oip)y 

The first summand vanishes by Lemma 2.2.1 and the last term vanishes 
since x*(x) = 0. Prom Vx(f) = Ya=x (/ ° i° t{sc:)vx{x'^) we see that 
Vx is uniquely determined by the n numbers Vx{x^), . . . ,Vx{x^). I 
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Definition 2.2.6. Let M be a manifold, (W, cp) be a chart, and 



/v^\ 

[v^J 



Then x^\ U ^ K, y ^ o {p[y) is the ith coordinate function with 
respect to {U,(p) and the collection a coordinate system. 

The pointwise basis {dy.i , . . . defined by dxi(x^) = Sf is called the 

Gaufiian basis associated with (W, (p) and the vector fields d^i are called 
Gaufiian vector fields. We will often simply write di instead of d^j . 



Corollary 2.2.1. Let M be a smooth manifold and {U,p) be a chart 
centered at x G M. For any derivation Vx € T>x and any function f G 
C°°{M) we have 



Vxif) = 



i=l 



i=l 



jdfo(p 
dx'^ 



-1 



where ^ is the usual partial derivative in K^. 

Proposition 2.2.2. The tangent space TxM is canonically isomorphic 
to Vx^ The isomorphism is given by the mapi: TxM — > Vx^ K[lU)if) = 
07 (^ 0 ) cmd well defined. 

Proof. Clear by construction. I 



Hence we can dispend with the symbol Vx ^iid always use TxM instead. 
Our first definition using curves has the advantage to work also in in- 
finite dimensional settings. However, we are only concerned with finite 
dimensional manifolds and derivations are more practical to work with 
than equivalence classes of curves. 

Definition 2.2.7. Let M be a manifold, x G M and U be a neigh- 
bourhood of X. For any Vx G TxM and f G C°^{U,K) the number 
v^9 f df{vx) := Vx{f) 'Is called the derivative of / in direction Vx> 
The map df: TM Vx^ df{vx) is the differential of f. 



2.2.2 The derivative of maps between manifolds 

In the preceding section we have linearised the manifold. We can now 
linearise differentiable maps f:M^N between manifolds thereby ob- 
taining linear maps Txf: TxM 

Definition 2.2.8. Let M^N be manifolds, x G M, and 'tp: M N be 
a differentiable map. Then Tx'tp: TxM TxN, Tx'ip{v){f) = v{f o 'ip) is 
called the tangent map (or simply the derivative j of ip. We will often 
denote Txip by ip^ . 




68 



2. Analysis on manifolds 



Observe that in terms of equivalence classes of curves, the derivative of 
^ is given by [73.] t-> o 7^(^)] which is clearly the linear approximation 
of the curve 'ip oj at '^(x). 

It is instructive to calculate the tangent map with respect to a coor- 
dinate system. Let (I/, <p) and (W, be charts of M and N respectively, 
and denote by (x^ . . and the associated coordinate 

systems. In these charts 'ip has the representation ^ ~ (p o 'ip o (p~'^ and 
we calculate 



i=X 

_ ,,i ° dj^Qipo (f-y 

d& dx^ ^ dx^ 

Hence with respect to a coordinate system, the tangent map V'* is just 
the derivative of the map Again we see the tangent map T^i/; is the 
linearisation of ^ at x. 

The following is an immediate corollary of Definition 2.1.6 and Lemma 
2 . 1 . 3 . 

Corollary 2.2.2. Let M be a manifold and x £ M. A continuously 
differentiable map ip: M N is a submersion (respectively, immersion, 
local diffeomorphism) near x if and only if T^ip: T^M T^(x)^ « 
surjective (respectively, injective, bijective). 



2.3 Tensors and tensor fields 



Tensor fields play a central role in geometry and physics. 

Differential forms are not absolutely necessary for the theory of 
space and time. However^ their usage has many advantages and they 
also provide a very natural way to define integration. In particular , 
the integral theorems of Stokes and Gaufi have a very simple, com- 
mon form when stated in terms of differential forms (cf. Theorem 
2.5.5). Unfortunately, the introduction of differential forms requires 
some technical preparation. 

Some readers may therefore wish to skip the sections dealing with 
differential forms on first reading. 

The tangent bundle of a manifold is the collection of its linear approxi- 
mations {TxM}. In order to make use of the simplifications arising from 
linearisation we need to express physical and geometrical objects in terms 
of maps which are adapted to the linear structure of T^M for all x e M. 
We will see later in the book that the notion of a tensor field provides 
a good framework for this (here still rather vague) idea. In the present 
section we simply introduce tensor fields as mathematical concepts. 
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2.3.1 Algebraic preliminaries: tensors 

In linear algebra, the concept of a tensor unifies vectors, linear forms, 
bilinear forms, linear maps, determinants etc. Let V be an n-dimensional 
vector space over K. Then its dual space V* is the vector space of all 
linear maps 1/ — ^ K. It is easy to see that V* is isomorphic to V. In fact, 
if {ei, . . . , 6n} is a basis of V then the set , 0^} c F*, defined by 

6'^{ej) = Sp is a basis of V*, It is uniquely defined by {ei, . . . ,6^} and 
called the dual basis. 

While the isomorphism of V and y* defined by ei (-> Oi depends on 
the choice of basis {ei, . . . , e^}, there is a canonical isomorphism ty of 
VandF**. 



V ^ iy{v): f ^ f{v) € K 

In the following we will freely make use of this canonical identification 
V ^ V** given by Ly and write v instead of Ly(v). 

Using this identification we not only can view a vector u as a linear 
map y* K but also a linear map A: V — ^ V as a bilinear map 
A: V* X V K, (/,u) A(/,u) = f{Av). This reasoning can be 

generalised and we are let to the following unifying concept. 

Definition 2.3.1. An Q -tensor is a map 

6: yx---xyxy*x-‘-xy*^K 

'W ,, / S / 

s copies r copies 

which is linear in each of its entries. We say that ^ is an r times co- 
variant and s times contravariant tensor or a tensor of order (^) . The 
space of all -tensors is denoted byTf{V). 

The most important special cases are To(y) = K, To(y) = y, and 
Ti(V) = y*. A bilinear form such as a scalar product is an element of 
T 2 (y) and a linear map y — y is an element oiTi{V). 

For an explanation of the terminology “covariant/ contravariant” see 
Remark 2.3.1 below. First we will need to define the components of a 
tensor. This in turn requires the introduction of the (natural) “tensor 
product” 0 of tensors which generalises the usual product of numbers. 

Definition 2.3.2. Let (j> e Tf{V), € T^{V)- Then we define the ten- 
sor product 0 ^ e (y) by 
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Observe that the tensor product is not commutative. 

Lemma 2.3.1. The tensor product is associative. 

Proof. Let 0, 02, 03 be tensors of order respectively. Then 

we have 

ii>l ® ^^2) ® ^3(vi, . . . , Usi , U^i+l, . . . , Vs^+S2 , VS1+S2+U ■■■, Vsi+^2+^2 , 
cu\ . . . , w’'S . . . , 

(V^l ^ V^2)(^l J * • • J t '^Sl+l t • • ' t ^51+52 » ^ J ‘ ‘ ^ J 

X V’3(?^«i+^2+i, • • . ,'ysi+«j+s3,w'’^'‘'’’^'''\ . . . ,a;’'i+’’2+’'3) 

= tpl(vi, . . . . . . ,0J^^)lp2(Vsi + l,. . . ,Vsi+s2,U>''''^\ ■ ■ ■ 

X V’3Ki+ 52 + 1, . . . ,?;si +32+^3. • • . , 
and analogously 

i)l®{lp2 ® ^3)(vi, • • • , V,, ,Vs^+l,..., ■Usi+ 32 , t^3,+S2+l 

wS . . . 



= i’livi,.. 


.,Usi,w\.. 






{'<p2 ^ 'lp3)(Vs^+l 


'^Sl-\-S2 5 "^Si+S24-1, ■ 


. . , Vs^^S2 + S3 




(^n+v2 


^n+r2 + l^ ^ri-fr24-r3^ 


11 

J-i 




•,w''")V’2(Usi + l,-- 


• ? ’^51+52 5 











X V'3K+.2+1. • • . , Vsx+.2+*b. 



Lemma 2.3.2. /jf {ei, . . . , e„}, {0^, . . . , is a pair of dual bases then 
the set 



® • • • (8> 6»*“ <8> ej, ® • • • (S» ejjii i.ji,...j..€{i, ...,«} 

forms a basis of the space Tf{V) of all -tensors. In particular, we 
have dim(T;(y)) = n'’n*. 

Proof. The set of tensors {0^^ (g) • • • (g) 0'^''^ ® 0 • • • 0 ey,,} is linearly 

independent. In fact, let be numbers such that 

n 

i>= Y1 
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Then 0 = , . . . , ^ ^ qIv^ — , hence the tensors are all 

linearly independent. Conversely, we see that for any tensor <f) € T^(V) 
and any , ? 7 ^ ^ V* we have 

n 

jlvjr = l 

0<9^" 0 6ji 0 • • • 0 e^v('^ij • * 

The dimension of T^{V) is n'*"n® since there are exactly choices of 
ordered ^-tuples (with possible duplication) from a set of n elements. I 



Definition 2.3.3. Let ^ e T^{V) and {ei, . . . ,Cn} be a basis ofV and 
, 0^^} be the associated dual basis. The numbers defined by 

n 

V' = T, •••£’ ® ® ® • • • (g) ej,. 

are the components of with respect to the basis {ei, . . . , e^}. 

In the physical (and old mathematical) literature it is the standard to use 
for contravariant entries upper and for covariant entries lower indices. 
This provides a checking mechanism for the syntactical correctness of 
tensor formulas and also simplifies the interpretation of formulas involv- 
ing tensor components. In Remark 2.3.4 below we will introduce a very 
effective notation (Einstein’s summation convention) which is prevalent 
in the physics literature and has at its core the difference between up- 
per and lower indices. Unfortunately, many modern mathematicians use 
lower indices for all entries on grounds of ‘‘aesthetics” . 

Remark 2,3,1, The terminology “covariant/contravariant” arose in the 
19th century and refers to the transformation of tensor components un- 
der transformation of a given pair of dual bases {ei, . . . ,6^}, 

Let V = v'^’ei 6 V, w = G V*, and A = . A^i ® 9^ € 

Ti{V) be an invertible linear map. Then {ei, . . . , e„}, {(9^, . . . , 9^} de- 
fined by Cj = Acj and 9'^ = 9^ o ( A~ ^ ) are also a pair of dual bases and 
we can write v = Yh=i ^ - Z^r=r w eV we have 

n n n 

oj(w) = ^u)(w^ei) = y^w^uj{Aei) = ^ w^A{u}{ej) 
i=l i=l 

^ Y1 '^"■^\^k9^{ej) = ^ w^Alujj 
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Hence Qi = The components of oj transform covariantly, 

i.e., in the same way as the basis vectors 
Similarly, for any X eV* we have 

n n n 

vix) = = Y. 

i— 1 i=l i“l 

= Y ~Xi(A-y,v>^0^ej = Y 

Hence v'^ — ^ ^^e components of v transform contravari- 

antly with respect to the transformation A, i.e., opposite to the basis 
vectors e^. 

Another natural operation which is defined for tensors is their ‘‘contrac- 
tion” . 

Definition 2.3.4. Let (j) e TJ (V) and {ei, . . . , , 0^} be a pair 

of dual bases. The contraction of <p with respect to the fth contravariant 
slot and the sth covariant slot of 4> is defined by 






n 



rth slot 

“ ^ ^ ^{'^1 ) • • • 5 > 

i=l 



gth slot 






^U) 



8-1 



)• 



We have to show that this definition is independent of the choice of basis. 
In fact, if {ei, . . . , e^}, {0 ^, . . - , 0'^] is another pair of dual bases then 
there exists a linear isomorphism A : V -^V with = Ae^ = 
and 0^ = 9^ o A~^ = We calculate 



n 

i=l 




n 

= y] ^(vi, . . . ,e^-, . . . . . . ,(9^ . . . , 

3 



Lemma 2.3.3. Let V be a vector space over K and (j) € T^iV), ip G 
Tl{V), Then 

(cl<l>)®iP = Cli4>®i>) and <t>®{CU)=Cllt{4>®^). 
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Proof, This follows immediately from the definitions, I 

Another class of natural operations on tensors are symmetry operations. 
We introduce below the two most important symmetry operators, sym- 
metrisation sym and anti-symmetrisation alt of entries. First we need 
some technical preparation. 

Definition 2.3*5. A permutation of the numbers (1, . . . ,p) is a bijec- 
tion 



^ {(^1) • ■ ' ? ^p) • {H? • • * ) ^p} ” {^5 • ■ • j P} }• 

If (Tp is a permutation we write cr(ii, . . . , ip) = (i<r(i)j • • • ? V(p))‘ '^he set 
of all permutations of the p integers {1, . . . ,p} is denoted by Sp, 

A transposition is a permutation which permutes only two consecutive 
elements and leaves all other elements fixed. 

Lemma 2.3.4. The set of all permutations Sp forms a group (the per- 
mutation group ) and is generated by transpositions. 

Proof. That Sp forms a group is clear since the collection of all bijections 
of a given set forms a group where the composition of maps is the group 
operation. 

Let cr(ii, . . . ,ip) = (io-(i), • • • ,icr(p)) be any permutation. Starting 
with the p-tuple (ii, . . . , ip) we can use successive transpositions in order 
to move the index to the last position. Assume now that ia-(k)^ • • • ? 

are at positions k^...^p. Since ^ ? V(p)} it must be 

at one of the positions 1, . . . , A; - 1. It follows that we can move ia{k-i) 
position fc — 1 by successive transpositions which all leave the last p — k 
positions invariant. By induction we have shown that there is a finite 
sequence of transpositions which is equivalent to a. I 



Lemma 2.3.5. There is a natural homomorphism 

sign: Sp ({-1,1},-) 

of the permutation group into the group of two elements which is deter- 
mined by sign(Tp) = -1 for all transpositions Tp, 

Proof. We prove this lemma by showing that every permutation a is 
either the product of an even number of transpositions (sign(cr) = 1) or 
the product of an odd number of transpositions (sign(a) = ~1). 

First we show that the identity permutation id is not the product 
of an odd number of transpositions. Assume that id is the product of 
finitely many transpositions and denote by nik the number of all those 
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transpositions which interchange the numbers I and fc. The number nik 
must be even since at the end I must be on the same side of k as at 
the beginning and since there are no other transpositions which inter- 
change k and L If we set nu — 0 then the number of all transpositions is 
which is the sum of even numbers and therefore even. 
Let now cr = n o • • • o = fi o * . . o f; where fj are transpositions. 
Since id = (ti o • • • ofi o • • • of^ = {rk)~^ o • • -o (n)“^ ofi o ■ • *077 

is the product of fc -h Z transpositions the number k -h I must be 'even. 
Hence k and I are both even or both odd. I 

A permutation ap acts in a natural way on a tensor 'll; e T^{V), 

Definition 2.3.6* For any ^ 6 T^{V) and any permutation ap € Sp 
we set CTpipivu ^..,Vp):= 

Lemma 2.3.6. For any permutations Tp^ap € Sp and any tensor € 
Tp{V) we have {(jp o Tp)'ip = Tp(<7p^). 

Proof. Let ui, . . , , Up € V. We calculate 

(cTp OTp)'0(ui, ...,Vp)= V^KpOr,.(l), - - ■ ,'^^TpOTp(p)) 

“ '^('^^rp(rp(l))? • • • > '^CTpo(rp(p))) 

— } • • • ? '^Tp(p)) 

= ' j ^Tp(p)) 

which implies the first equality. I 



Lemma 2.3.7. The maps 

sym: T^{V) -4 T“(V), alt : T^{V) ^ T^{V), 

^■P'^ sign(cTp)apV' 

(Tp^Sp CFp^Sp 

are linear projections. 

Proof. We only prove the lemma for the operator alt since the proof 
for sym is completely analogous. That alt is linear is clear from the 
definitions. For given ip e (F), vectors , fp and any permutation 
Tp we have 

altip{vi,...,Vp) = ^ sign(<7p)V’K^,(i),...,?^<r„(p)) 

(Tp^Sp 

== i Z) sign((Tp)sign(rp) • • • ,'u<T„or„(p)) 

o-p€Sp 
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= sign(Tp) alt V’Kp(i), • • • , Vr„(p)), 

where we have used that Rr: Sp Sp^ dp (jpTp is a bijection. It 
follows that 

alto alt V’((vi, ...,Vp) = ^ ^ sign(Tp) alt^(t;^^(i), . . . , v^^(p)) 

Tj,€Sp 

TpeSp 

= alt'ipivi^.^.^Vp) 



and therefore alt o alt = alt. 



Definition 2.3.7* A covariant tensor 'll; 6 T^{V) is called symmetric 
(respectively, anti-symmetric^ if for all s -tuples of vectors (vi,,..,Vs) 
and all permutations dg of [1, ... ,s) the equality 



i>{Vu . . • ,-Us) = ■ ■ ■,V<r.(s)) 

{respectively, i){vi, ...,v^)= sign((Ts)V'(v,(i)> • • • > V,(s))) 



holds. Symmetric and antisymmetric contravariant tensors are defined 
analogously. 

The set of all r times contravariant and s times covariant tensors on 
TxM, where x £ M, form a vector bundle which generalises the tangent 
bundle. 



[p. 65 i] 
■I p. 84 



Proposition 2.3.1. Let M be an n- dimensional, smooth manifold. The 
set TJM — UaieM of all {f^-tensors carries a natural vector 
bundle structure. 

Proof. Let (Z^q,, ipa) be an atlas of M. Then with each chart Ua we can 
associate a map 

: U T;(T^M) <I>,,{U) X 

x£U(x 

where components of the tensor <t>x with respect to the 

Gaufiian basis .,5a:”- It is clear that each tensor ipy € TJ(TpM) 

lies in at least one of the sets T^Ua '■= Ua:€W„ T^iT^M) (a e A). To see 
that the collection {T^Ua,ipa) forms an atlas of TJM we have to show 
that for each pair of indices (a, /3) the map 

o : MTIUc n T:U0) V-cCTIWc n T^Up) 
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is a diffeomorphism. We denote the coordinate system associated with 
(pp) by (^\ . . . y^) and let x For any vector v € TxM we 

can then write v = Yh=i = £^=1 where 









In other words, the column vectors (vp) consisting of the compo- 
nents of the vector v with respect to our charts are related by (va) = 
^<fap{^p)i where we have set (p^p — Pa^ Let a? be a 1-form. The 

row vectors (a;^), (tu^) consisting of the components of to with respect to 
the charts must then be related by (cj^) = {up)D{pap)~^) since these 
components are defined by w(u) = (o;^) • (va) = (uj^) * (vp) for all vectors 
V, By the same argument it follows that the components j and 

<t>\p)jl„j^ are related by 

Hence the components {4>a) and {4>p) of 4>x with respect to the charts 
i^py^p) are related by a linear isomorphism Dap- Thus the 
map xpa '^p{y,{^ 0 )) = {<^a 0 {y),{Dap{^ 0 ))) is indeed a diffeomor- 
phism. We have shown that TM is a manifold. That TM is also a vector 
bundle follows since for any x ^ M and any chart (UajPa) the map 
TJ(TcM) — ^ (j)x ^ i<l^a) is a linear isomorphism. I 



By construction of TJM we have TM = TqM. 

Definition 2.3.8. Let M be a manifold and r, s € N U {0}. The vector 
bundle TfM is called the tensor bundle of r times contravariant and s 
times covariant tensors. 

The tensor bundle TjM is also denoted by T*M and called the cotan- 
gent bundle of M. 

The bundle of s-forms is the subbundle A^MofT^M defined by 

oj € A^M ^u){vi,...,vu...,vj,...vp) = ~u(vi,...,Vj,...,Vi,...Vp) 
for any set of vectors {vi ^ . . . , Vp} and any pair 

(i,j) C {l,...,p}. 

The bundle of s-forms will play a fundamental role in Sect. 2.5 on dif- 
ferential forms (which may be omitted on first reading). We will give an 
equivalent definition in Lemma 2.5.8. See also Definition 2.3.9 for the 
vector space analogon A^{V). 
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In the rest of this algebraic section we will introduce the space of 
p-formS; a special class of tensors which generalises the notion of de- 
terminant and plays an important role in the analysis on manifolds. 
For instance, it is fundamental to the integration over a manifolds ( cf. 
Sect 2 . 5.4 below) and in the theory of differential systems (Bryant, 
Chern, G.ardner, Goldschmidt, and Griffiths 1991). However, the use- 
fulness of this concept will only become clear in applications and not 
on this technical algebraic level. 

The material presented here will not be used before Sect. 2.5. The 
reader may therefore wish to postpone the study of the rest of this 
section. 



It turns out that the symmetry operator sym is not of particular impor- 
tance. On the other hand, tensors if which satisfy alto-^ = generalise 
the determinant and therefore deserve a special definition. 



Definition 2.3.9* A form (of degree p) (or simply p-formj is a tensor- 
€ Tp(V) with dltxf ~ 'if . The vector space of all p-forms is denoted by 

The tensor product induces a product A of forms: 



Definition 2.3.10. The exterior product (or wedge product^ A is the 
bilinear map 



A: A^{V) X A^V) A^-^^iV), 



^ to Ap := 



(p + g)! 

p\q\ 



alt(w ® r;). 



Remark 2.S.2. The normalisation factor is not the only possible 

choice and may appear to be rather unnatural. Other factors are also 
common, in particular the alternative definition coAp = alt (o' 0 p). We 
have chosen our factor in order to minimise similar combinatorial factors 
in formulas to come (see Remark 2.3.3). 

Our choice agrees with the choice of Bryant, Chern, Gardner, Gold- 
schmidt, and Griffiths (1991) and (Abraham and Marsden 1978), but is 
different from the normalisation factor in (Abraham and Marsden 1967) 
and the normalisation factor in (Kobayashi and Nomizu 1963). 



Lemma 2.3.8. The exterior product is associative and antisymmetric. 
More concretely, let cv 6 A^(y), p € A^iy), and X e A'^(V). Then the 
formulas 

(i) (tv Ap) A X ~ iv A {p A X) and 

(ii) u) Ap — (— 1)^^^ p Auj. 



hold. 
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Proof, (i): We calculate 



(cj A r/) A A 



{{p + q) + r)\ 



{p 4- ^)!r! 

((p + q) + r)l (p + q) 
{p + q)\r\ 
ip + q + r)\ 



alt((w At])® \) 



p\q\ 



alt((w ®T]) ® X) 



p\q\r\ 



■ alt(w ®rj®X). 



Analogously one shows w A (77 A A) = alt(o> ®p®\) which proves 

associativity. 

(ii): Let Tp+q 6 Sp+q be the permutation given by Tp^q{l , . . . ,p + 

9 ) = (g + 1, . . . , g + p, 1, . . . , g). Then we have sign(rp+q) = (- 1 )p« and 

rp+q{0J ®p){vi,..., Vp+q) =W® r){Vq+u • • • , Vq+p, Vl, . . . ,Vq) 

= rj®w{vi,...,Vp+q). 



Using these formulas we obtain 

(v oV 

f slt(uj (g) rj) 
plql 

= ^ sign{ap+q)ap+q{(j®p) 

H Sign{(7p+q)sign{Tp+q)ap+qTp+q{uJ®p) 
S sign(ap+g)cTp+q(77(8)a;) 

= {—ly^p A u). 



Lemma 2,3.9. Let e V* and TTp e Sp, 

(i) 00^ A - ' Auj'^ = sign(7Tp)a;^?Ai) /\ . . . Alo'^p^'^\ 

(ii) o;^ A • • • A o;^ = sign(<7p)o;‘^?'^^) ® 

(Hi) A" 'Au)'^ = 0 i/ and only if the 1 -forms o;^, . . , , are linearly 
dependent 

Proof (i): Since transpositions generate all permutations, it is sufficient 

to prove this equality for a transposition 

7Tp(l, . . . , J, . . . , 77-) (1, . , . , 7, . . . , n). 

But in this case the assertion follows immediately from Lemma 2.3.8, 
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(ii) : The assertion is clearly true for p = 2 . Assume that it is true 

for collections of up to q 1 -forms , . . , , and denote for any aq € Sq 
the permutation (n, . . . , i-^ {(7q{ii), . . . ,aq{iq),iq^i) by ^,^ 4 . 1 ( 0 -^). 
Then we have 

uj^A ■ - Auj^ A 

^ 12 sign(Tg+i)(w^ A---Aoj‘<) 

Tq+lESg+l 

0 o;^“^ ('^rc,+ i(l)) • • • 5 "^Ty 4 .l(q+ 1 )) 

= ^ 12 X] sign(rg+i)sign(<Tg)w^( 8 )... 

Tq+lE:Sq+l CTij^Sq 

0 o;^ 0 ('t’c7yOry+l(l)) • • ■ ) '^(7q0rq+l(q) ^ '^Tq+l(q+l)) 

^^,12 12 Sign{Lg+l{(Tg)OTg+l)u>^(S)... 

CTq^Sq Tq+l&Sq+l 

0 0 ;^ 0 0 ;^"^ (t^ty+a(o-y)or,;+i(l)) • • • i'^Lq^i(aq)orq+i{qi-l)) 

= X3 sign(r,+i)a;^ ® . . . 

r,/+i65y+i 

® ® . . ■ ,Vr,+i(9),?^T,+i(9+l))- 

(iii) ; If o;^, . . . are linearly dependent we can assume that is 

a linear combinations of . , . , (otherwise we could renumber the 
u;^). There exist numbers (i = 2, . . . ,p) with o;^ = Hence 

the right hand side of 

p 

u;^ A“ • Au^ — ^ A A • • • A 

i=2 

is the sum of products each of them containing some factor uj^ twice. 
Hence all summands vanish by (i). 

If the forms o;^, . . . , o;^ are linearly independent then we can complete 
them to a basis {a;^ . . . , ou'^} of F*. Denote the dual basis by {ei, . . . , 6 ^}. 
If A - • A would vanish then so would A • ■ • A However, we 
have 

A--- Ao;”{ei,...,e„) = sign(c 7 -„)a;^(e<j„(i)) • • • 

Cf}, e S71, 

= XT sign((T„)5^^(i) = 1 

<T„6S„ 
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Lemma 2.3.10. //{ei, . . . , {0^, . . . , 9'^} is a pair of dual bases then 

the set 

A--- 

forms a basis of the space A^fV) In particular, dim(^P(V')) = (”) . 

Proof First we show that this set of tensors is linearly independent. Let 
be numbers such that 

A'-- =0 

il<i' "Kip 

and let fci < • • • < kp. Then 0 = r]{eki ^ . . . , e^p) = Vki...kpi whence the 
p-forms {0^^ A • ' • A ^^^'}ii<...<ip are all linearly independent. 

Conversely, we know from Lemma 2.3.2 and alt(Tp(y)) — A^{V) 
that the set of vectors A • • • A spans A^(V). Since 

Q^vin) /V ... A = sign((7p)0^^ A • • • A 

for all permutations ap € Sp we can restrict to^ those A • • • A with 
i]^ Ki ' ‘ • <c. ip* 

The dimension of A^{V) is since from a set of n elements there 
are exactly (^) choices of ordered t-tuples without duplication. I 

Definition 2.3.11. The map 

J:VxA^{V)^A^~\V) 

(v^uj) ^ vJlo: {wi ^ . . . ^Wp-i) ^ oo{v^wi ^ . . . yWp-i) 

is called the interior product. If p = 0 we set v Juj = 0. 

Lemma 2.3.11. Let to e A^{V) and G A"^{V). The interior product 
satisfies vJ{u) A rj) = (t? Jo;) At} + (—1)^ to A (vJrj) for all p-forms a; and 
all q-forms rj. 

Proof, For notational purposes we set v — wo* For any vectors . . . , 
Wp+q-i G y we have 

{WoJuj) A rj{wiy . . . , Wp^q^i) 

= fa Y sign{ap+g-i)uj{wo , . . . 

^ a„+,_i€5p+,_i 

• • ■ X »"^,,+,,_i(p+<,-l)) 

and 

{-lYiOA{woJ'n){wi,...,Wp+q-i) 
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pUq — 1)! ■ • • )‘i^Ty,+y_i(p)) 

'^7^ + «/ - 1 ^ + </ - 1 

We consider now all possible permutations of the ordered set {wq^wu 
. . . , Wpj^q-i), We can divide them into two subgroups, the group 5^^^ 

where wo is in one of the first p positions and the subgroup where 
it is in one of the last q positions. Using this notation we can write 



U)hp{W0,Wu , , , 

X • • • 5'^o-j,+y(p+g-l)) 

+ J2 sign(fp+,)w(tz;f^^,,(0),...,tyf,,+,,(p-i)) 

^ ^(^fp+y(p)5'^fp+y(p+l)) • • • 

< 

Consider the first summand. For each &p^q € Sp^^ we can shift wq to 
the first entry of to by executing (dp+g)“^(0) transpositions. Hence there 
is a unique permutation € Sp+g_i with 

• • • ^'^&p+,f(p-l))v{'^arp+,j(p)i • • • )'^ap+,j{p-\rq~l)) 

= . . . .■U'<Tp+„(P-l)) 

^ ^(^cr,,+,,(p)i • • • I ^o-p+,,(p+g-l))- 

Observe that dp+^ is the composition of ((7p+g)“^(0) transpositions and 
the permutation 

{ (7p^q{i) for i > 0 

0 for z = 0. 

This implies sign(dp+^) = (-l)(c^7>+7) ^(o)sign(crp^^). Since Wq can be in 
each one of the p possible entries of uj we obtain 



p\q\ sign(dp+q)ct;(ti;^^^_j_^^(0)T'^a-j,+y(i^ • • • i'^<T^,4-q(p-i)) 

X ^(w^CT,,+,,(p)> • • • >'W&p^,j(p+q_l)) 

12 sign(<Tp+,_i)a;(t«o,^yap+,_i(i),---,tt'^p+,,_i(p-i)) 

<Tp + t}-l€Sp4rij~l 

^ '^('^^7>+y-i(p)’ • * • » ^'7p+y-i(p+g-l)) 
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= {woJU}) /\v(Wl,---,Wp+q-l). 

Consider now the second summand. An analogous argument for fp+g e 

allows us to move wq to the first entry of 77 . In this case we obtain 
an additional factor (- 1 )^’ since fp+g is the composition of (rp+g)“^( 0 ) 
transpositions, a permutation 



i 



Tp+g(i) for i > 0 

0 for i = 0 , 



and the p transpositions which move Wq from the first entry of oj to the 
first entry of rj. This implies sign(fp+g) = 
and we obtain 



p\q\ 

Ty, 4. q _ 1 € *Sp 4, CJ, _ 1 

X '^(u;o, . . . 5'^Tp4y-i(p+(?-i)) 

= (“If a; A (u;o . . . , u;p+g-i). 



Finally, we relate the theory of p- forms to the determinant of linear maps. 

To motivate the definition below recall the following from linear al- 
gebra. Assume that we have an Euclidean scalar product (•, •) and an 
orthonormal basis {ci, . . . , Cn}. If {0^ . . . , 0^} is the dual basis then one 
can use the n-form A • - A in order to measure the volume of parallel 
epipeds. For any vectors &i, . . . ,6n ^ V one defines the volume of the 
parallel epiped spanned by these vectors to be A • ■ • A ^’^(61, . . . , 6^)- 
This number depends on the chosen scalar product but not on the or- 
thonormal basis. The determinant of a linear map B: V V is often 
defined as det(B) A • • • A , 6n) where bi = Ba. 

This definition of a determinant obscures the fact that the deter- 
minant is independent of the choice of scalar product. The following 
equivalent definition is probably the most natural way to introduce the 
concept of a determinant. 

Definition 2.3. 12« Let V be an n- dimensional vector space j W be a 
k-dimensional vector space over K, and A: V W a linear map, 

(i) The pull-back of 'll) under A is the map A*: T^{W) ^ T^iV) 
defined by . . . ,Up) = f (Aui, . . . ,Up). 




2,3 Tensors and tensor fields 



83 



(a) Lei fj, € A^{V) \ {0} and assume that V ~ W, Then the deter- 
minant det(yl) of A is the number defined by A* ij, = det{A)fa, 

We have to show that the map det is well defined. First observe that A* 
maps A'P{V) into A^{V) and recall that is 1-dimensional by Lemma 
2.3.10. Hence must be a multiple of /x. If ft is any other non- vanishing 
n-form then there exists a number a 7 ^ 0 with fi = a/j,. Hence we have 

A*fi{vi, = A*{afj.){vi, ...,vn)=^ afj.{Avi,. . , , Avn) 

- a det{A)fa{vi , . . . , - det{A)jl{vi , . . . , 

which implies that our definition for det(A) does not depend on the 
chosen n-form. 

Lemma 2.3*12* Let V be an n-dimensional vector space overK, 

A:V-^V 



be a linear map and {ei, . . , , Cn}, , 0'^} be a pair of dual bases. 

Let the components A'j be defined by Aci — Then we have 









In particular we have 



det(yl)= Y sign(<Jn)^i„(D 

iT„es„ 



Proof. We calculate 
A... 

= Y {Ae^^ ) ) {Ae„^, y,,) ) 

cp e Sj) 



Remark 2.S.3, We have noted in Remark 2.3.2 above that our combina- 
torial factors are not the only possible choice. If we had chosen A (cf. Re- 
mark 2.3.2) instead of A, we would have had to re-define vJcv = pv Ju 
in order to preserve Lemma 2.3.11. Another advantage of our choice is 
that the simple formula 

A • ■ • A 6^^(ei, . . . ,en) = 1 

holds for any pair of dual bases • 
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2.3.2 Tensor fields 

In this section we introduce tensor fields which are tensor valued maps. 
All physical and geometrical structures we will encounter can be defined 
in terms of tensor fields or maps of tensor fields. 

Definition 2.3.13. The vector space of all smooth maps 

TIM with 4>{x) € Tl{Ta,M) Vx € M 

x€M 

is denoted by Tf{M), These maps are called (^) -tensor fields. Often we 
will write 4>x instead of (j>{x) , 

A -tensor field is a vector field. 

A (p ) -tensor field to with 

u}{vi, . . . , Ui, . . . , Uj, . . . Up) = . . . , , Ui, . . . Up) 

for any set of vectors {ui,...,Up} and any pair {i,j) C is 

called a differential form of degree p. A differential form of degree p is 
often simply called a p-form. 

Let E he another manifold and f : E ^ M be a smooth map. An 
Q -tensor field along f is a map E Tf{AI) such that 6 

27 (T/(^)AT) for all xeE, 

Vector fields and differential forms along f are defined analogously. 
If E is a submanifold of M and f : E M its canonical inclusion 
then we also say that ip is a tensor field along E. 

Let (o:^, . . . , x'^) be a coordinate system and denote by di, . . . , dn its 
Gaufiian basis. Its pointwise dual basis is given by the derivatives (da:^, 

. . . , dx’^) (cf. Definition 2.2.7), With respect to these coordinates, any 
(^) -tensor field (p can be written as 

'^(^) = E ^7i ' X 9*1 ® ® ® < 2 ) ■ • • ® , 

where the <P^ff functions. They are the components (or component 

functions) of the tensor <p with respect to the coordinates (a:^ , . . . , x^). If 
()6 is a covariant symmetric (^) tensor field we will often drop the tensor 
product sign 0 in order to indicate that the symmetrisation operator 
sym has been applied, 

sym(^) (x) = ^ 1 < ji . . . is < (x) dx-’^ . . . dx-’* . 

If is a second manifold and / : M N a> smooth map then each 
tensor field <p € T^{N) induces a unique tensor field f*(p € its 

pull-back This tensor field is defined by 
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where Tf: TM TN denotes the derivative of / (Definition 2.2.8). 
Analogously, each tensor field $ G Tq (M) induces a unique tensor field 
e T^{N)^ its push-forward. It is defined by :== 

o 0 Tf). li 'll) £ 'T'f{N) and g: M N is a. local 

diffeomorphism we define the pull-back of 'ip by 

■- -0 {{9*ivi), ■ • . ,9*{vs), (9~'^y (w^), . . . , {g-'^y . 

This definition agrees with the definition of pull-back above if -0 is a 
covariant tensor field. The push forward of a tensor field -0 € (M) by 

a local diffeomorphism g is defined by g^'ip = (g~^)* ip. If $ — 0 then 
both definitions of push-forward agree. 

Lemma 2,3.13* For every local dijfeomorphism g: M N and all 
tensor fields (p,ip eTf (M) we have 

9 *{a<j> + /3ip) = ag*<j> + 

9*{(p®tp) = 9*<j>®g*-ijj, 

9*CU = cr/v- 

If g fails to be a diffeomorphism, and <p, ip are covariant vector fields then 
the first two assertions are still true. Analogous statements also hold for 
the push-forward. 

Proof. The lemma follows directly from the definitions, I 



Remark 2.3.4- K is often practical to denote a tensor field <p (M) by 
its components with respect to an unspecified Gaufiian basis. The 

indices ai, . . . a^., 6i . . . 6^ are ‘‘dummy indices” and their only purpose 
is to give a convenient method for explicitly denoting the entries of the 
tensor field. By convention, the tensor is characterised by the core symbol 
(p, so in general we have ~ ^ However, different 

tensor fields of different order can be denoted by the same core symbol 
since it is clear from the number of co- and contravariant indices that 
they must be different objects. 

If a,p £ K, (p, ^ are two (^)-tensor field, and ip is an (0-tensor field 
we write 

® In some East European countries, some authors use conventions which are 
the exact opposite of ours, i.e., they have in general (pll"ff ^ Pd\ "d. ~ 
I^ seems likely that this variant will fade in the near future. 
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(' 



^ ai ...a. 



= «C:::6:+/3<\::6:. 



+ p4>j 

Since the indices determine the entries explicitly, we can write 



4> 



ai ...ttr 



j.ai...ar 



C::.6T 






ai 

61. ..6s • 



The notation for contraction is often called the Einstein summation con- 
vention: If in a product of tensors one index letter appears as an upper 
index and somewhere else as a lower index then it is understood that one 
has to sum over these indices in the corresponding coordinate expression. 
We will also use this notation in order to write a tensor with respect to 
a basis, for instance, 



n 

Alda ® drr^ = Al&a 0 

a, 6=1 

Symmetrisation and anti-symmetrisation are denoted by 
sym(V')ii,„.,i. and 

An analogous notation is used for contravariant tensors and for the case 
that symmetrisation or anti-symmetrisation is only applied to a subset of 
entries. We use the delimiters | to indicate that a certain subset of entries 
is not symmetrised (respectively, anti-symmetrised). For instance, 

jab[cd]ief)g\h] 

n{j\k\l\{mn)\o) 

indicates that the entries corresponding to the indices {c, d, h} are anti- 
symmetrised and that the entries corresponding to the indices {e,/}, 
{jyho}, {m, n} are each symmetrised. 

This abstract index notation should not be confused with the com- 
ponents of the tensor field with respect to a given basis. While all for- 
mulas appear to be identical, in the former case, the tensor is referred 
to, whereas in the latter case one simply has a collection of K- valued 
functions. 

The main advantages of the abstract index notation over the usual 
notation without indices are that tensor operations are easy to remem- 
ber since tensors look like their components and that even complicated 
multiple contractions and tensor products can be understood at a glance. 
The main disadvantage is that many formulas look unnecessarily clumsy 
because of the jungle of indices involved. In the physical literature, the 
abstract index notation is usually preferred and in the mathematical lit- 
erature the notation without indices is prevalent. In this book, we will 
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use both notations. However, unlike in most of the physics literature here 
we will not reserve certain indices for special ranges. Moreover, we will 
sometimes write the summation sign in order to indicate the range. 



2 A Vector fields and ordinary differential equations 

If 7 : K ^ M is a curve then for each t G K the vector 7(f) Tt'y{l) is a 
vector in Ty(^t)M. The following question arises naturally: given a vector 
field V on M, does there exist a curve 7 with 7(f) = V^7(t)? With respect 
to a chart, the answer is given by the fundamental theorem for ordinary 
differential equations which establishes the existence and uniqueness of 
solutions. Because of its importance we will state it below. However 
this theorem should really be treated in standard courses on analysis 
(for mathematicians) or mathematics for physicists. See also (Dieudonne 
1960, chapter 10) for a proof in a slightly more general context. 

Theorem 2.4.1 (Fundamental theorem for ODEs.). LetU c 
V C J' CK be open subsets, and 

/;JxWxV->K”, {t,x,y)y-^ f{t,x,y) 

be C'^ . For each (to^XQ^yo) 6 J 'xU x V there exists a neighbourhood 
JxVdJxV of {to,yo) and a unique map 7: (t,y) h-j- j{t,x) 6 
with 

= fit, lit, y),y) 

for all (t,y) € X V and j(to,yo) ~ xq. Farther, 7 is . 

This theorem translates straightforwardly to manifolds. 

Theorem 2.4.2. Let M he a smooth manifold and V be a vector field 
on M. For every x E M there exists a subset C K and a curve 
M with 7x(0) = X and 'y{t) = for all t E J . 

If Vx ^ 0 there exists a neighbourhood U of {x} x {0} C M xM 
such that the map F:U ^ M, {t,y) ^ Ft{y) — 'Jyit) is well defined. 
Moreover, the map y ^ F^{y) is a local diffeomorphism for each t, and 
its inverse is given by 

For t, s small enough we have Fto Fs ~ Ft^s- 

The curve 7 is called an integral curve and the map Ft the flow oiV. 

Proof Let {U, ip) be a chart centered at x. Then the fundamental theorem 
for ordinary differential equations (Theorem 2.4.1) implies that there 
exists a solution /3{t) of the differential equation 

^P{t) = 
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with 13(0) = (f(x). Hence jx ~ ^ o /? is an integral curve of V with 

7:^(0) = X. 

That F is smooth and well defined follows from the fact that integral 
curves depend smoothly on parameters (cf. Theorem 2.4.1). The map 
(F^)~^ is given by (F^)~'^{z) ~ 7_(t) where 7_ is the locally unique 
integral curve of ~~V with 7-(0) = 2;. Hence F^ is differentiable and has 
a differentiable inverse. 

The equation Ff^^ ~ Ft oF^ follows from local uniqueness of solutions 
of differential equations and the fact that both t ^ FtJ^s(y) and FtoFs(y) 
are integral curves of V with the same initial value. I 

An integral curve j: J — > M of a vector field V is called maximal if 
the existence of an integral curve J M with J C J CK implies 
J ~ J. By the lemma of Zorn and local existence of integral curves 
each integral curve is contained in some maximal integral curve. The 
following Proposition shows that in the case K - R maximal integral 
curves are unique. 

Proposition 2.4.1. K = R then there is a unique maximal subset 
J cK and a unique solution 7 of'y(t) V^(^t) defined on J. 

Proof. Let 7,7 be integral curves of V with 7(0) — 7(0) = x. We must 
show that these integral curves coincide on the intersection J C\ J oi 
their domains J , J . In order to do so we will prove that the set K — 
{t € n v7 : 7(t) = 7(0} is both open and closed. Since in the case 
IK = R the set \s the intersection of two open intervals and therefore 
connected, this set must then coincide with K, It is clear that IC is closed. 
Let t €K. and (U^ ip) be a chart centered at x = j(t) — j(t). Given local 
coordinates, the problem of finding an integral curve of V reduces to 
solving a system of ordinary differential equations. An application of the 
fundamental theorem for ordinary differential equation (Theorem 2.4.1) 
proves that there is a unique local integral curve /? through (p{x). Hence 
there is a neighbourhood of t such that 7 = 7 on this neighbourhood. I 

The following theorem implies that, locally, all non-vanishing vector 
fields are alike. 

Theorem 2.4.3. Let M he a smooth manifold, x ^ M, and V he a 
vector field with V(x) 7^ 0. Then there is a chart (U,ip) centered at 
X such that <p^ = d^i and the integral curves of V are given by t 

Proof, Let N be an (n ™ l)-dimensional submanifold of M through x 
which is transverse to V (i.e., RH(y) 0 TyN = TyM for all y € N), Let 
(V,'^) be a chart of N centered at x and denote the flow of V by Ft. 
There is an e > 0 such that the map 
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/: (-e,e)xV’(V)^M, (2.4.1) 

. .,x") . . ,x”)) (2.4.2) 



is well defined. The differential of / at 0 is an isomorphism since 



Fq = id and 







= V{x) ^ 0. 



Hence there exists a neighbourhood W of 0 where this map is an dif- 
feomorphism. The pair (W, 9?) = {f{yV)^if~^)\u) is therefore a chart 
centered at x. For any y eU denote by (yoiW^iy)) ^KxA/'cKxM 
the unique pair defined by Fy^{pi^{y)) = y. Then we have (p o Ft{y) — 
f-^oFt(y) = f~^oFt^y^{pYj^{y)) = {t + yo,^p{pY^{y))). Hence the inte- 
gral curves of V are indeed the curves 1 It follows 
immediately that tp^V — d^i. I 

Given a vector field V, one can define the derivative of a tensor fiels 
in direction V. 



[p. 84 1 ] 
i P- 121 



Definition 2.4.1. Lei x G M, '0 be a tensor fields U be a vector field, 
and Ft the flow of U. Then 



is the Lie derivative of 

(^)|t=o usual derivative in vector spaces. In fact, the expres- 
sion Ft*?/? denotes a tensor field which is defined on a neighbourhood of 
rr if t is fixed and small enough. In particular, this tensor field can be 
evaluated at x. As a function of t this gives a curve in the vector space 

The Lie derivative measures the change of ?/? along V. 

Lemma 2.4.1. Let V be a vector field. Then the Lie derivative in di- 
rection V is a derivation, i.e., for any tensor fields ip,'ij) the formulas 



£y{p {g) ^) = £y(p g + 9? 0 
£y{(p + ?/j) = £yp -I- £y1p. 



hold. 

Proof. These formulas follow immediately from the properties of deriva- 
tives. I 
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Theorem 2.4.4. Let [/, V be vector fields and f a smooth function. 
Then 



£uf — U • f and 
£uV ^f = U.V.f-V.U.f, 

Proof, The first equation follows immediately from 

Let X 6 M and (W, (p) be a chart centered at x. There is a neighbourhood 
V C W of X and a number e > 0 such that Ft{y) is well defined and 
satisfies Ft{y) G U for all for all (f,y) € (-e,e) x V. An application of 
the Taylor formula to the map 1 1 -> / o F-t o ip~'^ implies the existence 
of a smooth map g\ (-e, e) x ^(V) -> R with / o F-t © = f{z) + 

tg{t,z) for all (t,z) G (-e,e) x V. The map gt(y) := g(t,(p(y)) satisfies 
/ o (Ft)~^(y) = f{y) + tgt(y) for all j/ G V and we obtain 

= i.., ‘ f) = 4... (''«« • (/of-.)) 

where in the last step we have used 

■= • (1,.-.^ ° (s,.=oW)-h) 



Theorem 2.4.4 shows that the Lie derivative of V in direction U is the 
commutator of U and V, This motivates the following definition. 

Definition 2.4.2. IfU^V are vector fields then we call [U^V] = £jjV 
the Lie bracket or the commutator of U and V. Vector fields commute 
if their Lie bracket vanishes. 
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Commuting vector fields are of particular interest since Gaufiian vector 
fields d^k , dy .1 are necessarily commuting. The following lemma gives the 
converse to this observation. 

Lemma 2.4.2 (Geometric interpretation of the Lie bracket). 

Two vector fields [/, V have vanishing Lie derivative, £jjV ~ 0, if and 
only if their flows commute. 

Proof Denote the flows of U and V by Ft and Gg. The equation FtoGg = 
Gs o Ft implies that F-toGgO Ft ~ Gg is the flow of V. Hence we have 

V = o G, o Jf,) = T(F,)-‘ (£G,) oF, = F,‘V and therefore 

Conversely, assume that £i/V = 0 which is equivalent to Ft* V = 0. 
Since F^*V bh integration yields {FtYV = V for all t. This implies 
that F-^t^GgO Ft is an integral curve of V. From the uniqueness of 
integral curves we get F-toGgoFt — Gg for alH, s. I 



Corollary 2.4.1. Let M be a n-dimensional manifold and {?7i, . . . , 11^} 
be a collection pointwise linearly independent, pairwise commuting vector 
fields defined on an open neighbourhood of x e M. Then there exists a 
coordinate chart (V, ip) centred at x whose Gaufiian basis vector fields 
satisfy d^i = Ui. 

Proof Denote the flow of by F^ and let 

. . . ,x") = o . . ■ o F^„(x) 



for sufficiently small {x^,...,x'^) e Since the vector fields Ui are 
pairwise commuting so are their flows F^i (cf. Lemma 2.4.2). Hence we 
have for every i e {1, . . . , n} 






Fli o FL o • • • o F 

X^' X^ X 



d-1 



I 






This implies ^ . . . , — Ui for the standard 

basis {F^i , . .. ,En} of Since the vector fields {Ui,. . , ,Un} are linear 
independent the map ^ has maximal rank and is therefore a local diffeo- 
morphism. Let W C be an open neighbourhood of 0 such that ‘ip{z) 
is well defined for all ^ 6 W and one-to-one on W. We can now define 



Corollary 2.4,2. Let M be a 2-dimensional manifold and U , V be vec- 
tor fields which are at each point linearly independent. Then M admits 
local coordinates (xi,X 2 ) such that || U and d ^2 || V. 
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Proof, By Corollary 2.4.1 we only have to show that there exist functions 
/, h with [/?7, hV] = 0. We calculate 

0 = mhV]^Vfjj(hV)~V^,yfU 

= fhVuV + fdh{U) - fhVyU - hdf{V)U 

= fh[u, y] - hdf{v)u + fdh(u)v 

- fh {[U, y] + din(/i)(j7)y - din(/)(t/)y) . 

Let be the 1-forms which are dual to J7, V, Then any solution 

(/, h) of the uncoupled system of linear ordinary differential equations 

0 = dln(/i)(C7) - y]), 0 = dln(/)(y) + oj^{[U, y]) 

satisfies [/[/, hV] = 0. This system of differential equations can be solved 
by the fundamental theorem for ordinary differential equations 2.4.1. I 

In the following sections we will encounter various kinds of tensor deriva- 
tives. It is therefore practical to formalise their common properties. 

Definition 2.4.3. Let D be a map which maps tensor fields into tensor 
fields. If it satisfies 

(i) Z)(T/(M))cT/(M), 

(a) D{<p 0 ” D('0) (g) 9 ? + ^ ® D{(p) (^^product rule”), 

(in) D commutes with contractions, 

then D is called a derivation. 

Corollary 2.4.3. Let M be a manifold and V be a vector field on M. 
The Lie- derivative £y is a derivation. 

Proof, We have to verify only the third property. This follows from 
Ft^Ct(j> = CtFt^cj) and the fact that ^ is linear (so the 

derivative ^ can be interchanged with this operation). I 

Proposition 2.4.2. Two derivations coincide if they coincide on vector 
fields and functions. 

Proof Writing an arbitrary tensor field ^ in a coordinate representation 
we obtain 

~ C) • • • ® 9^,. 0 dx*^^ 0 • • • 0 dx-^^* 

r 

+ ® • • • (S> D{diJ (g) • • • (g) (g) dec-’" (g) • • - (g) da;-’- 

t = l 

+ ^ 0 • • • 0 di^. 0 dx-^^ 0 • • • 0 D(dx-^^ ) 0 • • • 0 dx-^^*^ . 

t=i 
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Hence we only have to show that D is uniquely determined for tensor 
fields uj e Ti{M). But this follows from D{u{V)) = D{Cl{u) 0 V*)) = 
= Cl{Dco®V +uj(S>DV)) — Duj{V)~\-(jj{DV) for arbitrary 
vector fields V and tensor fields to £ Ti{M), I 

Recall that vector fields can be considered as derivations acting on func- 
tions, We show now that the commutator of vector fields can be gener- 
alised to arbitrary derivations. 

Lemma 2.4*3. Let D^D,D be derivations. Then the commutator 
[D,D] — DoD~DoD 



is also a derivation. Moreover ^ the Jacobi identity 

[D, [b, b\\ + [b, [D, b]] + [b, [b, d\] = o 



holds. 

Proof. For the first assertion we only need to check that the product rule 
is satisfied. This follows from 

D o D{(f <S) t/j) = D{D(p + D'il)) 

— Do Dip (g) ^ 0 Z) o Di/j -f- Dip 0 D^ -f- Dp 0 D'lj; 

and the fact that the term Dp 0 D^ + Dp 0 D^ is symmetric with 
respect to D and D. 

The second assertion is a special incident of a general property of 
commutators of the form AB ~ BA: The summands in 

[D,[b,b]] + [b,iD,b]],[b,[b,D]] 

12 3 4 

= DoDoD — DoDoD ~{D o D o D — D o D o D) 

5 4 16 

"" > ^ ^ "■ i ^ ^ 

-f D o D o D ~ D o D o D ~{D oDoD — DoDo D) 

3 6 5 2 

-i-DoDoD — DoDoD ~(i) oDoD — DoDoD) 

= 0 

cancel pairwise. I 



Corollary 2.4.4. For any vector fields V, W we have 
[U, [F, [U, V]] + [y, [W, [/]] = 0. 
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Proof, Clear since vector fields can be considered as derivations acting 
on functions. | 

Proposition 2.4,3. For vector fields U, V, we have £^u yj = [£jj, £y\. 

Proof, Clearly, both £^^^y^ and [£ij,£y] are derivations. By Proposi- 
tion 2.4.2 we only need to show that they coincide on functions and on 
vector fields. For any function / we have [£jj^ £y] • f = £ij£y • / - 
•^V^U = = = V]/- Hence the 

formula holds for functions. Corollary 2.4.4 implies for any vector field 
W 



[£jj, £y]W = £^{£yW) - £y{£ijW) = [?/, [V,W]] - [V,[U,W]] 
= [[U,V],W]^£p^y^W. 



The Lie bracket of vector fields does not only transform naturally with 
respect to diffeomorphism but there is also an especially simple relation 
if one considers smooth maps which are not necessarily diffeomorphisms. 

Proposition 2AA,^Let f : M N be a smooth map and V,W be vec- 
tor fields on M, IfVyW are vector fields on N with Txf{Vx) = ^/(x) 
and T^f{W^) — for all x ^ M, then the formula Txf{[V^W]) = 

holds. 

Proof Let y? : ^ R be a smooth function. The assertion follows from 

V{W{^ o /)) ^ V{{Tf{W){ip)) o /) ^ {T,f{V){Tf{W){ip))) o / 

= (V{W{<p))) O f. 



2.5 Differential forms 

While it is possible to avoid the usage of differential formSf they are 
such an important tool in analysis and mathematical physics that I 
have chosen to include them in this book. Differential forms will be 
used occasionally in the book, for instance in the treatment of electro- 
magnetism. 

The reader can skip this section on first reading but she or he is 
advised to read the motivation below. 

This section builds on the theory of antisymmetric tensors which 
is presented in Sect. 2.3.1 starting at page 77. 
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Differential forms are totally anti-symmetric covariant tensors. There are 
areas in mathematics and physics where this anti-symmetry proves to 
be of great importance. 

(i) Systems of partial differential equations: Recall that by the lemma 
of Schwarz the higher derivatives of a function commute. If one 
has a system of partial differential equations, any solution must sat- 
isfy this “integrability condition” . For the existence of a solution it 
is often sufficient to ensure that this integr ability condition holds. 
Since anything symmetric applied to something anti-symmetric van- 
ishes, such conditions can be naturally expressed by the requirement 
that certain differential forms vanish. 

(ii) Integration: Recall from linear algebra that the volume spanned by 
n vectors {6i, . . . , bn} in is given by the determinant ] det(jB)| 
where B is the linear map given by Bei — bi and {ei, . . . , 6n} is the 
standard basis of K^. As the determinant is totally anti-symmetric, 
differential forms are its natural generalisation. The lemma of Poin- 
care (Theorem 2.5.2) and the theorem of Stokes (Theorem 2.5.5) 
which unifies the classical integral theorems of Gaufi and Stokes are 
good examples for the superiority of using differential forms. 

(iii) Physical applications: There are also direct physical applications 
of differential forms. They are a prerequisitive for understanding 
gauge theories (cf. (Bleecker 1981)) of elementary particles and in 
particular the theory of electromagnetism (cf. Sect. 5.2.3). 

Recall from Definition 2.3.8 that the set A^M = Uaj€M A^(2® Af) of all 
p-forms is a vector subbundle of 

Definition 2.5.1. We denote the set of all differential forms of degree 
p by Q^{M) := {w € Tp{M) : altocu — u} (cf Definition 2.3J3). 

If M is a real manifold we will sometimes denote 17^ (M) by i?^(M, R) 
(cf. Remark 2.5.2 below) 

The definitions and properties of p-forms given in Sect. 2.3.1 carry 
over to differential forms in a pointwise manner. 

Lemma 2.5.1. Let 0)^7] be differential forms andV be a vector field, 

(i) For any smooth map <f): M N the exterior product satisfies 
(f>*{u At]) — 4>*uJ A 4>""r}, 

(ii) If (f: M ^ N is a local diffeomorphism then Jcj) = (^*u) J 
{(fAoj) holds, 

(iii) The differential form u can uniquely written as 

co(x) ~ ^ A ' - Adx^^A 

l<ii<‘“<ip<n 

where {x^^, . . ^x'^) is a coordinate system. 
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Proof. The proof follows directly from the definitions. I 

There is a simple formula which relates the Lie-bracket and the interior 
product. 

Lemma 2.5,2, Let lj be a differential form and U^V vector fields. Then 
the formula {U, V]Jlu = TjjiV Juj) - V J £jjuj holds. 

Proof. If u; is a 0-form then we have [U,V]Juj = 0 by definition. The 
right hand side vanishes for the same reason. If a? is a 1-form then we 
have by the derivative property of £jj 

[U,V] JU) = u{£ijV) = £if{ui{V)) - {£fju){V) 

— £jjiy Jw) — V A£jj(jJ. 

Assume that the assertion of the lemma holds for 1-forms and p-forms. 
For any (p -{- l)-form oj we find p- forms and 1-forms rf with u) = 

a Hence we get 

n 

[U,V]Juj^Y^[U,V]Jir]^Aoj^) 

= Y^ilU, V]Jr,^) [U, y] Jo;* 

i=l 

n 

^=1 

-ri' A{£ij{VJui*)-VJ£jjio^)'^ 
n 

= E ( -iv')) A - {V J £ijrt) A 

- rf A {£u{V aJ)) +ri^ a{VJ£jjlj*)^ 

n 

= E ^ J ^0 ^ 

i=l 

-Vj{{£jjr]^)Aw^) - {£jjr]^) AVJlo^ 

- £jj {rf A (y Jw*)) + {£jjn^)AVAu^ 

-V A{rt ^{£u^')) - iy A rf) A £jju:^ 

n 

= Y.{^u{VA{r^^Aur^))-VA {£uW A w*)) ) 

i~l 

— £ij{y J(jo) ~ V j £jju). 

Hence the assertion follows for arbitrary degree by induction, I 
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By far the most important construction for differential forms is the ex- 
terior derivative which will be introduced in the following theorem. 

Theorem 2.5.1. For each p e N U {0} there is a unique map 

d; QP(M) QP+\M) 

such that the following properties hold: 

(i) d is K-bilinearf 

(ii) d o d = 0 

(in) d{u Arj) — du (— A drj for all q-forms lo and r-forms 

(iv) For f € (i.e.f functions f:M~^K)df coincides with 

the usual differential 

Definition 2.5.2. The operator d of Theorem 2,5 J is called the exterior 
derivative. 

Observe that for the definition of the exterior derivative we do not need 
any additional structure. This fact indicates that in many applications 
it will play a fundamental role. In comparison, the Lie derivative of a 
tensor field is only defined with respect to a given vector field. 

Proof of Theorem 2.5,1. First we show that d is a local operator, i.e. , if U 
is an open set with compact closure and w, p G satisfy (jO\u = r\\u 

then {du))\u ~ (d^)|w- To see this let V be an open set with V cU and 
/i: M — ^ R be a smooth open function with /i|y = 0 and = 1. 

Since h{co — r/) = w - p we obtain from (iii) 

d(o^ — rj) — d{h{u — r})) = d/i A (o> — p) + hd{tj — 7])). 

This implies (d(o; - ri))\v = 0 since both dh and h vanish on this set. By 
the arbitrariness of V we have therefore proved (dw)|^ = {dr})^K. 

Since d is a local operator we can restrict to chart neighbourhoods. 
We will prove the theorem by showing that for each chart (ZY, <p) there 
is a unique operator d which satisfies properties (i)-(iv) above. Let uj e 
QP{M) and write w = a^h...ipdx^^ A - - • Ado;^’'\ Properties 

(i)-(iv) imply 



do> = d I ^ A • • • A da:**' 

= ^ . . .ip ) A da:*' A • • • A da:*’' 

l<ii<---<ip<n 

+ Wii..,i,,d(dx*' A - - • A da:*’')) 
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l<ii<'-'<ip<n 

V 

+ ^ A • • • A 

l<ii<‘”<V<n j~l 

A dda:^-^‘ A dx^’+^ A • • • A dx^^* 

— ^ A da:^^ A * • * A da:^^\ 

l<ii<-'‘<ip<n 

Thus we have shown that do; is uniquely defined if it exists. Furthermore 
this explicit formula also guarantees existence once we have shown that 
dw := Ei<ii<...<i„<n‘^(‘^n -v) ^ A • • • A dx^f satisfies (i)-(iv). 

Properties (i) and (iv) are clear. For (iii) we calculate for to G 
and rj € 

d(c<; A Tj) 

= d ^ ^ii.^ApVju^.jq A da;^^ A • • • A da:^'^^ A da:^^ A • • • A da;^^^ 
J2 <^^h-ipVh-3., A da;^^ A • • • A da:^^' A da;-^^ A • • • A da:-^‘^ 

1 ^ i ^ ^ ^ Tt 

l<ji<-^<jq<n 

+ ^ A drr^^ A ■ • • A A da:*^^ A • • • A da;-^^^ 

l<ii<“'<ip<n 

l<ji<‘"<j(j<n 

= ^ duJi^,^Aj, A da:^^ A • • • A dx^^^ A dx^^ A ♦ ♦ • A dx-^'^ 

l<ii <*-><?p<n 
i<ai<-*-<jy<n 

+ (-1)^ X] A da;** A • ■ • A da;*'’ 

X ^ 1 1 ^ ^ ij} ^ Ti» 

A dr;ji..,jy A dx-^^ A • • • A dx*^^^ 

= do? A ?7 + (—1)^0; A d? 7 . 



Property (ii) is follows from 



dd(cL?i^,...i^J A dx^^ A • • • A dx^^E 

l<ii<'>‘<ip<n 

and the fact that for any function / G C7°°(M, R) we have 



t,3~l 



-1=1 
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r\2 ^ 

where we have used that is symmetric in i and j by the lemma 

of Schwarz, I 



Remark 2,5,1, Notice that this definition of d coincides with Definition 
2.2.7 in the case of 0-forms. 

Corollary 2.5.1. If N is a second manifold and (j): M ^ N a smooth 
map, then we have = d{(f>*uj) for all p-forms u) on N, 

Proof, This follows since properties (i)-(iv) are obviously satisfied for 
(p*duj and since the exterior derivative is unique. I 



Corollary 2.5.2. Lie derivative and exterior derivative commute: Let 
cv e and V e Tq{M), Then the equation 

£ydco = dLyUJ 

holds. 

Proof. Denote the flow of V by Ft, Then we have d{Ftyuj = {Ft)*duJ 
and therefore 

I 

We wish to give a formula for da; which does not depend on a chosen 
coordinate system. The idea is to link the exterior derivative to the Lie 
derivative and the interior product. 

Lemma 2.5.3. Let u e f2^{M) and V e Tq(M). Then we have 
£yuj = V Jdca -h d(V Jo;) 

Proof, We prove the lemma by induction. If p = 0, then we obviously 
have y Jct? = 0 and £yto — V mu; = do;(y) V Jdto. Assume now that 
the assertion has been proved for all g 6 {0, , p} and let w € 

Since the formula which we want to prove is local we can restrict to a 
coordinate neighbourhood U and write 

(jj — ^ A • • • A = cui A dx^ 

il 
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where uji e f2^(U) are suitably chosen differential forms. Recall that £y 
is a derivation and that therefore 

£y(cji A dx*) = 1 ^ sign(<7p+i)<7p+i(wi ® dx®) | 

\ o-p+i€Sp+i J 

p! l ! ^ V sign((7p4-i)(7p+i (g) 

-\rUJi® £ydx^)) 

— £yuJi A dx'^ d-uJi A £ydx^ 
holds. On the other hand, we have 
V J d{coiAdx^) + d{V J {cui A dx^)) 

= VJ {duJi A dx®) + d((y Juji) A dx® + A (V Jdx®)) 

induction 

/ N 

= (y J duJi) A dx® +(-l)P+Mwi A (y J dx®) 

induction 

, 

+ d(y JuJi) A dx® +(-l)Pdo;i A (y Jdx®) 

* 

^ ^ .. 11 ^ 

+ (-l)P(-l)Pwi Ad(yjdx®) 

induction * 

S ,, ^ 

= £yiVi A do;^ + Cdi A £ydx^ 

where we have used d{V Jda;") = d(da;"(y)) = d{£yx'^) = £ydx^^ (cf. 
Corollary 2 . 5 . 2 ). | 

We can now use the preceding lemma in order to prove an invariant 
formula for the exterior derivative. 

Proposition 2.5,1. Let oj e 0^{M) and Vo,...,Vp be vector fields. 
Then the exterior derivative of lo is given by 

p 

dw(Vo, . - . , yp) = ^(-i)®i:y,(a;(yo, . . . , y, . . . , yp)) 

i=0 

0<i<j<p 

0<i<j<p 

where means that the corresponding vector field is left out. 
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Proof. The second equality follows trivially from the first equality. We will 
prove the proposition by induction, If a; € Q^{M) then do? (Vo) = Vo«c<; = 
£y^{(jj) which implies the equality in the case p = 0. Assume now that 
the assertion has been proven for g € {0, . . . ,p} and let co € 

Lemma 2.5.3 implies 

dw{Vo, Vp+i) = {Vo J do.) (Vi, . . . , Vp+i) 

induction 

= £v^iu{Vi , - d(yoJo;)(Vi,...Vp+i) 

p+i 

= , Vp+^)) -Y.^{Vi,..., Vi-i, £v^Vi, Vi+i,..., Vp+i) 

induction 

S 

P+1 

- ((^0 Jio)iv^, y,-i, y,+i, • • • , v^p+i)) 

induction 

E i-iy-^^'^-HvoJu>)i£vyk,vi,...', 

i<j<fc<p+i 

induction 

P+1 

- Y,i-'^y~M£vo^i, Vi, . . . Vi-i, Vi+i, . . . , y^+i) 

i=l 

p+1 

+ ^(-iy£y,(+v'o,yi,.. 

j=i 

+ Y1 i-iy-^’^uj{£vyk,Vo,Vu...yj,...,Vk,...,Vp+i)). 

l<j<k<p-\-l 



p+1 

= J2i-iy£v.ujiVo,...,Vj,...,Vp) 

0<j<k<p-^l 



I 
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2.5.1 The lemma of Poincare 

The lemma of Poincare is the generalisation of the following two facts 
familiar to physicists, 

(i) Every curl-free vector field has a local scalar potential and 

(ii) every divergence- free vector field has a local vector potential. 

The lemma of Poincare is a good example for the elegance of dijfer- 
ential forms. 

To give the reader a better idea how the lemma of Poincare arises we will 
briefly recall the introduction of scalar and vector potentials and then 
translate this discussion into the language of differential forms. 

(i) Let F: W C be a smooth map which satisfies rot(F) = 0. 

Then for every x £ U there is a neighbourhood V and a function 
/ : V ^ M with F\y ~ grad(/). In Mechanics this mathematical fact 
is applied to conservative force fields F, The function / is called the 
associated scalar potential 

(ii) Consider now a map V : W C whose divergence vanishes, 

4- = 0. Then for every x £U there is a neighbourhood 

V and another map VP : V such that rot(VP) = V. The map 

W is called the vector potential of V. Vector potentials arise for 
instance in the elementary theory of electromagnetism. 

In the language of differential forms, these (and similar) results can be 
treated in a simple, unified manner. To see this, we will briefly sketch 
two classes of isomorphisms which will be studied in more generality and 
more detail in Sect. 4,2. 

The standard scalar product of induces an isomorphism 

which maps vector fields to the 1-forms, V := (V, •)j^ 3 . The inverse 
map is denoted by jj: u; where = to. These isomorphisms 

are will defined since the scalar product is non-degenerate. With respect 
to the standard orthonormal basis this isomorphism just interchanges 
column and row vectors. 

The scalar product also induces an isomorphism between 0-forms 
and 3-forms and an isomorphism between 1- forms and 2- for ms. Let 
{x^^x"^,x^) be the standard coordinate system of and set 

★1 — dx^ A d.T^ A dx^^ 

T^drr^ = da;^ A dx^, 
i<dx^ = dx^ A dx^, 

'kdx^ — dx^ A dx^. 



This defines linear isomorphisms A'^(R^) —> yi^(R^) and A^(R^) 
yi^(R^). They can be extended to 2- and 3-forms by demanding ★★cj = to 
for all forms in R^. 
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Observe that grad(/) = d/^, rot(F) = (vcdF^)^, div(y) — 'k{d'kV^). 
Hence the (i) and (ii) above relating F and /, V and W translate into 
the assertions 

(i) If the 1-form satisfies dF^ = 0, then locally there exists a 0-form 
/ with d/ = 

(ii) If the 2-form satisfies d * V'’ =0, then locally there exists a 
1-form with dW^ = 

The lemma of Poincare generalises these facts. 

Theorem 2.5*2 (Lemma of Poincare). LetM be a manifold andu G 
J7^(M) with p > 1. If du ~ Of then for every x ^ M there is a neigh- 
bourhood U of X and a {p - l)-/<9rm 6 € such that = d6. 

The proof of the theorem will be a corollary to the following lemma. 

Lemma 2.5.4. Let M be a smooth manifold and define for any t G [0, 1] 
the map 

ii : M — ^ [0, 1] X M, X (t, x). 

There exists a linear map K : 1] x M) (q > 0) with 

doK + Kod = {iiY ~ {ioY. 

Proof For w e 1] x M) and t'l, . . . , € T^M we define 

KuJa;{vi,...,Vg)= f {it*{dtJu)(^t,x))){n,---,Vq)dt 

Jo 

Let Fi, . . . , Vq^i be vector fields on M. Then, using Proposition 2.5.1, 
we obtain 

dKw(Vu^..,Vq+i) 

q+l ^ 

= -l‘^(t,x))(n, .... K, , Vg+r) dt 

a=l 

l<a<b<q-\-l 

• • • ? Pa? • • • ? Pb? • • » ? P^+l) 

For any vector field V on M we will denote the canonical lift of V to 
R x M, given by (t, x) (0, V) G TfR 0 TxM^ by the same symbol V. 
In the following it is notationally advantageous to write Po for dt. With 
these notations and using [Pb,K] = [dt,Va] = [(^t? 0), (0, P^)] = 0 the 
{p + l)-form dFo; is given by 



dFo;(Fi,..,,F,+i) 
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4-r->- pi 

tt=l 

l<a<6<g+l ^ 

• • ♦ ? « j > ^+l) dt 

= - D-i)“ / • K-)(^o, Fi, . . . , K, . . . , y,+i) ) 

E (-1)“+*'/ 



dt 



0<a<6<g+l 



Analogously, Kdu is given by 






Kdu>(Vr, ..., Vg+i) = f' it*(dtJduj^t,.))(Vu • • • , V^+i) dt 
Jo 

= j dw(t a;)(Vo, Vi, . . . ,14+1) dt 
Jo 

j-i 9+1 

a=0 

0<a<6<g+l 

• • • ) • • • 5 Vb^ ■ • • } ^+1) dt, 

Taking the sum, d o iiT + AT o d, all terms with a > 1 cancel and we arrive 
at 

{do K K o d)u{Vi ^ . . . , Fg-fi) = / 9t • • • * j ^+1) dt 

= (ii)*o;(yi, . , . , T/,+i) - (to)*u;(yi, . . . , 



Proof of Theorem 2.5.2, Let (V, (p) be a chart with ip{x) = 0 and (p{V) D 
{(a;4 ...,x^): Ea=i(^“)^ ^ !}• Setting 

U:=ip-H{{x\...,x-): E(a:“)2<l}) 

a=l 

and F(t, y) (f~^{t'^T{y)) we have constructed a smooth map F : [0, 1] x 
U which satisfies F(0, y) = x and F(l, y) — y for all y ^U. 
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Since Fo%i is the identity on U and FoiQ the constant map y x^we 
have for any p-form a?, {F o = cu and {F o io^uj = 0. The assertion 
follows now from 

d o KF*U) -Ko dF^u = ilF^u) - = a; 

and dF*o; = F*do; = 0. I 

Observe that the proof of the lemma of Poincare allows us to calculate 
0 = KF*u> explicitly. 

Definition 2.5.3. A differential form uj e is closed «/do; = 0 

and is exact if there exists a differential from Q G with to = do). 

In this terminology the lemma of Poincare simply states that every closed 
differential form is locally exact. In general, this is not true globally. 
Corollary 2.5.3 below is a global version of the lemma of Poincare. 

Definition 2.5.4, Let M,N be manifolds and fff:M^Nbe smooth 
maps. /, / are homotopic if there is a differentiable map F: [0, 1] x M — > 
N such that F{0^x) — f{x) and F(l,x) = f{x) for all x G M. The map 
F is called an homotopy. 

A connected manifold M is called contractible if the maps id : M 
Mj X x and the constant map Cx^: M — > M, x ^ xq are homotopic. 

If xo,Xi are in M and 7: [0, 1] M is a curve from xq to xi then 
F(t, x) ” 7(f) is homotopy between the two constant maps Cxq and • 
Hence the following lemma implies that contractibility does not depend 
on the choice of Xq. 

Lemma 2.5,5. Homotopy is an equivalence relation which we denote by 



Proof The relation / / is clear since we can choose F(t, x) = x for all t. 

If / cf ^ and F is a homotopy between / and g then F(^, x) := F(1 — x) 
is a homotopy between g and / whence g — f. Assume that f — g and 
g h and let F, (respectively, F) be homotopies between / and g 
(respectively, g and h). Let cj): [0, 1/2] ^ [0, 1] be a smooth map which 
satisfies 



(i) <^(0) = 0, 

(ii) <^(1/2) = 1, 

(iii) = 0 for all fc > 1 

and rp{t) = 1 — ^(1 — t). Then the map 



F:[0,1]-.M, (t,x) 



fF{(^(t),x) if te [0,1/2) 
if [1/2,1] 



is a homotopy between / and /i, 
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Lemma 2*5.6» Let f^f:M^N be smooth homotopic maps and to e 
be a closed differential form of degree p> 1. Then /*o; — f*uj is 

exact. 

Proof, Let F be a homotopy between / and / and K: > 

be the operator defined in Lemma 2.5.4. Then we have 

dKF*uj = dKF*(v + KdF*tu = {i^F*uj - {iP)*F*u = f*cj - f*cj. 



Corollary 2.5.3. Let M be a contractible manifold, p > 1, and to G 
be a closed differential form. Then u) is exact 

Proof Let M M the constant map x xq- The differential form 
id^^n; — is exact by Lemma 2.5.6 and the assertion follows from 

id* C l; = U), {cxc,Y^ = 0. I 



2.5.2 The theorem of Frobenius 

The guiding idea in analysis is that it is much simpler to work with a 
linearisation of a function than with the function itself. Analogously, it 
is often much simpler to specify properties of the tangent bundle of a 
manifold than to describe the manifold itself. We are therefore interested 
in the following problem. 

Let F be a vector subbundle of TM. What are the necessary and 
sufficient conditions for the (local) existence of a submanifold 
N C M with TN = En^ 

If iV is a submanifold of M then TA" is a subbundle of TM which has 
the property that for any two sections U, V oi TN (i.e. any two vector 
fields along N which are at each point tangent) the commutator [U, V] is 
again a section of N, This follows since A is a manifold in its own right. 
On the other hand, not every subbundle E of TM has this property, for 
instance, take M = and E = {a{zdx + dy) -j- bdz : a,b e M}. For 
this vector bundle we have [zd^ + dy,dz] = ~dx ^ E{^x,y,z)^ Hence there 
cannot exist a submanifold A of M with TN — E^. This motivates the 
following definition. 

Definition 2.5.5. A vector subbundle E ofTM is called integrable if 
for any two sections U,V of E the Lie derivative [U, V] is also a section 
ofE, 

An integral manifold of E is a submanifold N of M with TN c E, 
An integral manifold A is called maximal ifTxN = Ex for all x e N. 
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The theorem of Frobenius (cf. Theorem 2.5.3 below) asserts that inte- 
grability is also sufficient for the local existence integral manifolds. This 
justifies the terminology. In order to verify integrability it is sufficient to 
consider a single frame for i.e, k linearly independent locally defined 
vector fields Vi, . , . , which at each point x span 

Lemma 2.5.7. A k -vector subbundle E ofTM is integrable if and only 
if there exists a local frame {Vi , . . . , 14} of E such that for all i,j the 
commutator [Vi^Vj] is a section of E. 

Proof It is clear that the condition is necessary. Let I/, V be any sections 
of E. Then there are functions ^0^ with U = aWi and V ~ 

ELi and 

[U, y] = ([/ • - y • a^)Vi + Vj] e E. 



Theorem 2.5.3 (Theorem of Frobenius, contravariant form). 

Let E be a smooth subbundle of TM. Then through every x G M there 
is a locally unique maximal integral manifold of M if and only if E 
is integrable. Moreover, A4 depends smoothly on x. 

The basic idea of proof is as follows. Let 14 , . . . T4 be a frame of E and 
be the submanifold swept out by the integral curve of V\ through 
X. At each y € iV^ we can consider the integral curve of 14 through y. 
All these integral curves together form a set subset of M. Now for 
any z e we take the integral curve of 14 through 2 ;. These integral 
curves form a subset of M and so on. One then has to check that 
really is a submanifold. This is not entirely straightforward because 
in general, the intermediate sets . . . , are not submanifolds. It 
can be shown, however, that the subsets . . . , are submanifolds 
of M if the frame {Vi, ^ . ,Vk} is carefully chosen. 

We will prove the theorem for an equivalent covariant form (cf. The- 
orem 2,5.4 below) using an analogous strategy. 

A /c-dimensional vector subbundle of M can be described as the inter- 
section f]^~i kern(o;^), where are some suitably chosen, at each point 
linearly independent 1-forms. In fact, let {Vn~k , « • • 5 Kz} be a frame for 
E and {I 4 , . . . , Vn-k] be a completion to a frame of TM. If . . . ,uj^} 
is the dual basis, then E — kern(o;^). 

Definition 2.5.6. A collection of finitely many pointwise linearly inde- 
pendent 1-forms {ijJi] is called a Pfaffian system. The Pfaffian system is 
integrable if the vector subbundle E = p| kern((u^) is integrable. 
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If a 1-form vanishes on a submanifold i.e., u)^{v) = 0 for all v € TAT, 
then dcc>, being intrinsic to should also vanish: do;^(u, u) = 0 for all 
u, u € TxN . This is equivalent to (do;^ A A • • • A = 0 which is 

the same integrability condition as before: 

Lemma 2*5.8. Letcu^^ . . . be 1- forms which are linearly indepen- 
dent at each point. Then the vector suhbundle E kern(u;^) is 

integrable if and only if dco^ A' • •Au}'*^~^ — 0 for alii € {1, . . . ,n— fc}. 

Proof We extend to a coframe {a;^, . . . of T*M and 

let {Vi, , , . , V^} be the dual basis. E is then spanned by {V^_/c+i, . . . , V^}. 
We calculate 



dLjHVk.Vi) = Vf,.uj\Vi) - Vi.uj\Vk)~uj\[Vk,Vi]) - -iv^{[Vk,Vi]), 

It follows immediately that da;^(yfc, V;) (i E {1, ~ k}) vanishes for 
all Vk, VieEif and only if [Vk, Vi] e E for all 14, Vi e E. I 



Theorem 2.5.4 (Theorem of Probenius, covariant form). 

Let be a Pfaffian system which satisfies dw"^ Aw^ A • * • A 

i^ri-k _ Q ^ {1, . , . , n — /c}. Then there exist coordinates (x^, 

. . . , x^) and functions with 

uT — , x’^)dx-^ {j = 1, . . . , n — /c). 

The manifold N ~ {y ^ M : x^{y) = x*^ (x)Vj e {1, . . . , n — fc}} is a max- 
imal integral manifold of E :— kern(c<;^). In particular, Theorems 

2.5.3 and 2.5.4 are equivalent. 

Proof of Theorem 2.5.4^ As outlined above, we will prove the theorem 
by induction over k. 

Let k = 1, Then E :— kern(cj^) is a one dimensional vec- 

tor subbundle spanned by a single, non- vanishing vector field V. By 
Theorem 2,4.3 there exist coordinates (x^, . . . ,x^) such that V = 

Since lo'^{V) = 0 {i — 1, ... ,n — 1), there must exist functions coj with 

w* = • I x‘^)dx^. 

We assume now that the theorem has been proven for A; = 1, , . . , fc~l. 
Let / be any function such that d/, , . . . , u'^~^ are linearly indepen- 

dent. Clearly, this system of differential forms satisfies the integrability 
conditions 



dd/Ad/Aw^ A--- = 0, 
dw* A d/ A A • • • A = 0. 



By our induction assumption there exist coordinates {x^,. . . , i”) with 
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d/ = Mx)dx^ + ■ • • + (2.5.3) 

w* = + • • • + 

We will now show that there is a PfafBan system which 

defines the same vector bundle as the original PfafBan system but does 
not depend on . . . , Equation (2.5.3)implies that / must be a 

function of :r^, . . . , only. We can therefore substitute one of the 

coordinate functions by /, say / = At each x e M the two sets 

of 1-forms {dx^, . . . ,dx^“^,d/} and {d/jCj^, . . . are each point- 

wise linearly independent and span at each x ^ M the same subspace of 
T*M. Hence there are functions with where 

(^j)i j=i n-k invertible matrix at each point. This implies that 
there are functions such that the differential forms = dx^i-h^{x)df 
span the same space as {co ^^ . . . Since the vector bundle E := 

kern(o;^) = kern(f?^) is integrable we obtain from Lemma 

2.5.8 

0 = df?^ A A • • • A 
= d/i* A d/ A A • • • A 
= d/i* A d/ A d^i A ■ • • A dx”~^. 

It follows that the functions (and therefore also the 1-forms de- 
pend only on . . . , (where we have used / = x'^~^~^^). 

Since the 1-forms , 1?^“^ do not depend on 

the PfafBan system {i?^^ . . . , can also be considered as a PfafBan 

system of the space parameterised by x^, . . . These 

forms define a 1-vector subbundle of At the beginning of 

the proof we have already established the assertion in this case, hence 
there are new coordinates (x^, . . . of and functions 

such that f?j(x)dxA These equalities also hold for the 1- 

forms considered on M since they only depend on x^, . . . , 

The assertion follows now from the fact that the 1-forms are linear 
combinations of the 1-forms I 



2.5.3 Orientable real manifolds 

This section is a prerequisitive for the following section on integration. 

We were led to the definition of a manifold by the localisation of the 
global concept of a vector space and have seen that a manifold can be 
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thought of as a collection of local which have been patched together. 
The example of a sphere shows that, globally, a manifold may be very 
different from In this section we give a (very primitive) global classi- 
fication of manifolds by dividing the collection of all manifolds into two 
classes. More sophisticated global classifications can be found in books 
on differential topology such as (Guillemin and Pollack 1974) or (Bott 
and Tu 1982). 

Definition 2.5.7. A real manifold M is orientable if there exists a 
nowhere vanishing n-form v on M. An orientation of M is the choice 
of one of the two equivalence classes {fv : / e \ {0})}, 

{~fu : / e C^(M, \ {0})}. An oriented manifold is an orientable 
manifold together with an orientation. 

Proposition 2.5.2. A real manifold M is orientable if and only if it 
has an atlas {{Uk,^k)]ken such that for all a, 6 € N and all x eUaCiUs 
the differential D (^(pa ^ : R’^ ^ has positive determinant 

Proof : Assume that is a nowhere vanishing n-form and let 

be a countable atlas. In order to simplify notation we renumber these 
charts such that for each k> 2 there is a j < fe with Vj O V/c 7 ^ 0. 

We set (Ui^ipi) — (Vi,'^i) thereby trivially defining an atlas for Vi 
which satisfies the positive determinant condition This atlas can be ex- 
tended by an induction argument. Assume that we have defined an atlas 



for the set Vi U • • • U Vfc which satisfies the positive determinant condition 
and let j € {1 , . . . , fc} be an index with V^+i n Vj 7^ 0. 

There is a nowhere vanishing function fj : (p{Uj) R with = 

fjdx^ A • • • A dx'^ and a nowhere vanishing function fk^i : (p{Vk+i) R 
satisfying {'ipk-\-i)*^ = fk~^idx^ A • • • A dx'^. Since neither fj nor /fc+i 
vanish on (pj(Uj) fl i/;fe+i(Vfc+i), we have either fj • fk-^i(y) > 0 or fj • 
fk-hi{y) < d ^11 y ^ ^ '^/c+i(Vfc+i)- in the first case we set 

{Uk+i,(pk+i) = {Vk+iyi>k-\-i) and fk^i = f^^i 
whereas in in the second case we set 

= (Vk+i,Xi^ 2 ^i^k-hi) and fk-^i = ~~fk-^i, 
where Xi^2 is the reflection defined by 
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In either case we have then = fk^idx^A- • ^Adx‘^ and fj-fk-^i > 

0. This implies 



det(D (^(fj o (cpk+i) dx^A • • • A dx'^ 

= {(fj o {(pk-^i)~^y dx^ A • • • A dx'^ 



fj 

/fc+i o {(pj o 



dx^ A • • • A dx'^ 



and therefore det(J9 {ipj o > 0 in either case. We still have 

to show that det(I> o > 0 for any % G with 

Ui f\Uk^i 7 ^ 0. For I e + 1} let fi be defined by ~ 

fidx^ A ” ’ A d.x^ (/ G {p^ g}). Since all fi (Z G {1, . . . ^ k} have the same 
sign and also fk~\-i and fj have the same sign it follows that sign(/i) = 
sign(/^-) sign(/fe+i). Hence det(D (cpi o > 0 

“<=r”: Let {gki M [0, l]}A:eN be a partition of unity subordinate 

to {Uk}k&i let 



^ ...dx"). 

k=l 

This is a smooth, well defined n-form since at each x only finitely many 
gk{^) are non-zero and the support of each gk is contained in Uk. Let 
X €hii and iij. . Ap all indices with pi- {x) ^0. Then 

V 

dx^ A • • ■ A dx" 

j=l 

does not vanish since all pi . are strictly positive and all 



have the same sign. 



Definition 2.5.8* Let M be an oriented manifold and v G 17^ (M) 
be a representative of the orientation. An oriented atlas is an atlas 
{{lda^^a)}aeA such that for each a e A there exists a strictly positive 
function Ida 

{<Pa)*l^ = ^'o,l -ndx^ . . . dx”, 

where (o:^, , . . , x'^) are the standard coordinates ofW'. A positively ori- 
ented chart is a chart which belongs to an oriented atlas. 
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Example 2 . 5.1 (Mobius band, continued from page 53 ). The Mobius 
band M defined in Example 2.1.2 is not orientable. Using the same 
notation as in its definition, the set Ui intersects U2 in two subsets, 
VV’4. = 7T"^((a,2a) X (-6,6)) and and W_ = 7T~^((0,a) x {-b,b)). The 
map D(<^i o (<^2)“^)|3. has positive determinant for all x G >V+ and 
negative determinant for all x G W_. Assume now that M is orientable, 
i.e., that there is a nowhere vanishing 2-form 1/ on M. Then there are 
nowhere vanishing fimctions 

A, 



with 



((¥>1) ^)* i' - fidx^ A dx^ and ((^2) ^)* v - Adx^ A dx^. 

From {{(p2)~^)*i^ = Adx^ A dx^ = ((951 o (<^2)“’^)* (Adx^ A dx^) we 
obtain that /2(x) =:= det(D (<pi o (ip 2 )~^) fi for all x G W+ U W_. Since 
the determinant det(D (ipi o (952)”^) changes sign we get a contradiction. 
Thus the Mobius band is not orientable. 

It is a simple but good exercise to actually build a Mobius band from 
paper and to verify using this model that there are closed curves along 
which there does not exist any continuous frame. 

2 . 5.4 Integration on real manifolds 

In this section, we restrict to K = R. This is necessary since we 
need to employ partitions of unity. See Remark 2.5.2 below for the 
integration of complex valued functions. 

One usually introduces integration as a method to deterniine the volume 
of an open, bounded region B c The main idea is as follows. We 
divide B into small parallel epipeds Bi. Each Bi carries a number f{Bi) 
representing the volume of Bi. Summing up all these numbers gives an 
approximation for the volume of B. Clearly, the function / which maps 
parallel epipeds into real numbers must satisfy certain properties. The 
most obvious property is that if we divide Bi into two disjunct parallel 
epipeds Ai,Ci with Bi = Ai U Q, we have f{Bi) ^ f {Ai) -h /(C^), 
at least if Bi is sufficiently small. Then, choosing an infinite sequence 
{{Bi^a}iei{a)}a fy € N) of such divisions we obtain a sequence of numbers 
{Sie/(a) fi^ha)}aeN which in most cases of interest has a well defined 
limit, the volume vol(B) of B. Linear algebra indicates the following 
choice for /. Let {ei, . . . , 6 ^} the standard basis of E^, { 0 ^, . . . , O'^} its 
dual basis, and 6*4, . . , , bi^n those vectors which span the parallel epiped 
Bi. The number f{Bi) ~ 6 ^ A - • . . . , bi^n) is then the Euclidian 

volume of Bi with respect to the standard Euclidean scalar product. This 
function clearly satisfies the additivity condition above. If one knows 
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how to determine volumes one can also integrate continuous functions 
: S — ^ R (for instance mass densities) by replacing the differential form 
A • • • A 6*^ with the differential form A • • • A 0^. 

Let us now turn to manifolds. The main problem here is that we do 
not have a linear space in which to embed the cubes. However, by now 
we are familiar with the idea of translating concepts to their infinitesi- 
mal counterparts. Since the tangent space was introduced as the linear 
approximation of the manifold, it is natural to place our parallel epipeds 
which use the linear structure of R^ into the tangent spaces rather than 
into the manifold itself. In other words, we divide M into small sets Vi 
such that each of these sets corresponds to a parallel epiped in Txi,M^ 
where Xi is a point in We cannot define a canonical volume because 
a general manifold does not have a preferred frame {£?i, . . . , This 
indicates that it is more natural to integrate n-forms directly than to 
define the volume of a set first. We will later recover the volume as a 
special case (cf. Definition 4.2.1). 

To simplify part of our discussions we will define integration for n- 
forms which are not necessarily smooth. 

Definition 2,5.9. Let M be a real manifold. We denote by f?^(M) the 
set of all continuous n-forms. 

We clearly have i?^(M) C Let (W, (p) be a chart and oo G 

be an n-form with compact supp(c<;) C U, Writing (p = (a:^, . . . , x^) there 
is a unique smooth function ooi...n with u) = o;i...nda;^ A • • • A dx'^. We 
define 

/ u— uJi,^,ndx^ A • - /\dx^ := / o . . . da;^, 

j{Uyip) J{U,ip) 

where the last expression is the usual integration in R’^. We still have to 
show 

(i) that ^ CO does not depend on the chosen chart, 

(ii) how to extend this local definition to manifolds which may not be 
covered by a single chart, 

(iii) how to extend this local definition to n-forms which do not have 
compact support. 

(i): Let (W, ip) be another chart, denote by . . . , y^) the corresponding 
coordinate functions and set 
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Qrpa 



o'ij;dy^ A ‘ • A dy'^ 



and therefore 

/ ...dy” 

r 



= / ‘ 

J^{U) 

“’X 

“"I 



dx^ 






^i...n o . . . da;’^. 



dy^ ... dy” 

o o((po ip)~^dx^ . . . dx" 



v(U) 

Hence the definition is indeed coordinate independent if one fixes an 
orientation in advance and restricts to an oriented atlas. 

In order to address (ii) and to define integration globally we will 
employ a partition of unity. 

Definition 2.5.10. Let M be an oriented n- dimensional real manifold 
and cv € f2^{M) be a n-form with compact support Then 






where {{Vlb^^b)}peB oriented atlas such that each Ub has compact 
closure j fa a partition of unity subordinate to {Ub}b^B cb’^d 6(a) is an 
index with supp(/o,) C W{>(a)- 

Since each x £ M has a neighbourhood which is intersected by only 
finitely many supp(/a) and supp(o;) is compact the sum in the definition 
above is finite. 

Let {gc}cec be another partition of unity subordinate to {{Ub}h^B 
and 6(c) be an index with supp(^c) C Since all sums involved are 
finite we can calculate 



Y.3cfa 

= EE/ 

a€A c€C 

= EE/ 



U) 



/>(«,) ) 



9c fa^ 



c€C a€A ^(^h(c)^^h(c)) 



9c fa^ 



(VI 



_ dc faVJ 

ceC ^ ^^Kc)^^Ha)) aeA 
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cec '^(^ft(c)»^r>(c)) 

This implies that the definition is independent of the chosen partition of 
unity. Since by coordinate invariance it is also independent of the chosen 
oriented atlas our definition of integration over n-forms with compact 
support is well defined. 

We will now address point (iii). If a; G does not have com- 

pact support, its integral may not exist. This is completely analogous to 
the integration of functions / : M — > R. For our purposes the following 
extension is sufficient. 

Definition 2.5.11. Let M be an oriented n- dimensional real manifold 
and (jj € Let {^a?^a)}a€A be an oriented atlas and for each a 

let Ua be defined by A • - • A dx^. The 

modulus of oj is the continuous n-form \ca\ locally defined by ~ 

A ■ ■ • A dx'^. 

Let M be an oriented n-dimensional real manifold and (^a 7 9?a)aeN be 
a countable oriented atlas such that each Ua has compact closure. As a 
preparation to the following definition we first need to give a meaning to 
the expression for arbitrary a; € 0^{M) which do not 

necessarily have compact support. For each j € {1, . . . , A;} let Ij = G 
{j + 1 , . . . ,fc} : UjCMJi ^ 0 }. if 77 G has compact support 

then we have ’J = f(UjnUu^j)v)- If 

cu G is a general differential form we define 




Definition 2.5.12. Let M be an oriented n-dimensional real manifold 
and {Ua^(fa)aen be a countable oriented atlas such that eachUa has com- 
pact closure, Then-formu G is integrable if the (monotonically 

increasing) sequence 

{/ |w|}fegN= Tl I 

JUiU-UUk aeAk 

where {f^ : Ui \J • - U Uk} a partition of unity subordinate to {{Uh]h^B 
and b{a) is an index with supp(/a) C tlb{a)' bounded. 

Clearly, any uj G (M) with compact support is integrable. The defi- 
nition is independent of the chosen atlas. To see this let (Vb^'ipb)ben be 
a second countable oriented atlas such that each Vb has compact clo- 
sure. Then for each fc G N there is a j{k) G N such that UiU . . ,Uk C 
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Vi U . . . Vj(fe). Hence - /viu-uv^t^, ^Mch implies that 

{/wiu -uwfc is bounded if {/yju...uv,- is bounded. 

We can now define integration for differential forms which may not 
have compact support. 



Definition 2*5*13. Let M be an oriented n- dimensional real manifold 
and {Ua^^a)a€N CL Countable oriented atlas such that eachUa has com- 
pact closure. For any integrable u) € we define its integral by 

/ u) = lim 
Jm 

This limit is well defined since integrability implies that 



/ 



1 / 

JUiU-‘UUk 



I 

JUiU--’UUk+i 




for fc 00 and therefore that ^^}a:€N is a Cauchy sequence. 

Observe that we can integrate any integrable n-form over any subset 
open subset U and any (not necessarily integrable) n-form over any open 
subset V with compact closure since these forms are clearly integrable 
with respect to the (open) submanifolds U and V. 

The following two lemmas are direct consequences of our definitions. 



Lemma 2.5.9 (Linearity of integration)* Let M be a real manifold 
and io^u) G f?^(M) differential forms which are integrable. Then we have 
for any real numbers a, b 





U). 



Lemma 2.5.10. Let M^N be oriented real manifolds and f:M^N 
be a diffeomorphism such that f^^p is a representative of the orientation 
of N for each representative v of the orientation of M. Then we have 
for any a; € a; € (M) 




/ 

JN 



to. 



Complex valued functions play a very important role in functional 
analysis and quantum mechanics and vector valued differential forms 
are used in both physical and mathematical gauge theory. (For in- 
stance^ the analogue of the electromagnetic field strength is described 
by a Lie algebra valued differential form.) Readers interested in in- 
tegration over such objects can find some elementary definitions in 
the remark below. In this book, however, we will use these concepts 
only in order to motivate the definition of mean curvature vector field 

(cf. Definition 4 ’4’^)' 




2.5 DifFerential forms 117 



Remark 2,5,2, We have restricted to K = E since partitions of unity do 
not exist in general over complex manifolds. It is possible however to inte- 
grate complex valued differential forms over a real manifold M. Let M be 
a real manifold. A complex valued differential form is a map uj: x iVx 

p entries 

/ ' N 

where lox’ TxM x ^ ^ • x TxM C is multi-linear and anti-symmetric. 
Denote by /2^(M,C) the space of all complex valued differential form 
of degree p. For each to € C) there exist two uniquely defined 

differential forms e E) such that u = We call 

io € Q'^{M^ C) integrable if both and are integrable. The integral 
over an integrable complex valued n form a; is defined by 



f 00= ! a;"® -f i / a;''' 
Jm Jm Jm 



We wish to show that integration over complex valued differential forms 
is C-linear. Additivity is clear. With to € C) and a = E 

C we have 



f acxj — f {a^' 
Jm Jm 






Jm 

= (a^® [ w"® - + i(a^® f -h [ a;"®)) 

Jm Jm Jm Jm 

= (a^® Hh [ (u;"® 4- 
Jm 

= a Lo, 

Jm 

This definition can be further extended to vector spaces. Let K = E 
or K — C and 2J be a fc-dimensional vector space over K. A vector valued 
differential form of degree p is a smooth map uj\ Ux where 

p entries 

. s 

: TxM X • • • X TxM -> 

is multi-linear and anti-symmetric. Denote by the space of 

all vector valued differential form of degree p. If cj € and 

{t)i, . . . , t)/g a fixed basis of there exist k uniquely defined differential 
forms € Q^{M) with lo = A vector valued differential 

form uj is called integrable if all are integrable. It is clear that the 
definition is independent of the chosen basis. We can now define the 
integral of a vector valued differential form uj via 
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This definition is independent of the chosen basis since for any other 
basis . , . , with we have uj - where ~ 

therefore 




We turn now to the theorem of Stokes which generalises the classical 
integration formulas of Gaufi and Stokes which in turn generalise funda- 
mental theorem of calculus, f\x)dx = f{h) - /(a). 

First we need to define the concept of a manifold with boundary. 

Definition 2.5.14. The pair (M, 9M) is an n- dimensional oriented 
manifold with boundary if there exists an n- dimensional oriented man- 
ifold M and an embedding t: M M such that 

(i) the topological boundary di{M) is an (n — 1)- dimensional sub- 
manifold of M which is diffeomorphic to dM, 

(a) for each x G do{M) there is a positively oriented chart (U^ip) of 
M centered at x with n i{M)) = {y € <p{U) : < 0}, 

An oriented manifold with boundary is compact if i{M) U dt{M) C M 
is compact 



We will usually identify M with i{M) and dM with di{M). 

Let M be a manifold and N a hypersurface in M such that M = 
M \ N is connected. Then (M, N) is not an oriented manifold with 
boundary, even if M is oriented and compact. The following theorem of 
Stokes would not hold for (M, N). 



Theorem 2.5.5 (Theorem of Stokes). Let (M, 5M) be an oriented 
compact real manifold with boundary and uj 6 Assume that 

do; and u) are integrable over M and dM, respectively. Then Stokes^ 
formula 




dto 



holds. 



Proof Let {Uj , pj}jeN be an oriented atlas of M such that for each j there 
are intervals (aj, 6]), . . . {aj, b^) with PjiUj) = (a], 6]) x • • • x (a^, b^). 
The definition of a manifold with boundary implies that we can also 
assume that for each Uj which intersects dM the equality gjj(pijf\M) = 
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{y € ^ipi) : t/^ < 0} holds. Let {fj}j € N be a partition of unity 
subordinate to 

We show first that 



/ d(/jO;) = /* /j 

JMnu.i JdMnUi 



u 



(2,5.4) 



holds for each j e N. We may write fjio ~ A * • • A dx'^ ^ A 

dx^"^^ A- • • Adx’^ which implies d{fju) = A- • ♦ Adrr’^. 

We set 



n vn\ ^ -on-l 



[aj,bjY = (aj, 6j) x . . . (a}, 6}) • • • x (a^, 6^) c 



where indicates that the ith interval is omitted. Since has 

compact support in (aj, 6j) x • • • x (aj,b'j) the left hand side of Equation 
2.5.4 is given by 



/ d(/jw) 

JMDUj 

JMnUi ^ 



f y(-l)*+i^dx^..da;" 

J(a^.X^.)y^-^y(n.V-.hV\ ^ dx'^ 



/(a5,6))x...x(ay,6y) 



-/ 

J(aiM)x- 









drc^ . . . da:^ 





i=:2 


,b%...,xn 




=0 


A 







..W'-.-dx" 




=0 








...,x”))da;2...da:” 




=0 =0 


if Uj intersects 9M, 


4.,- 


,6,-]i ( , a;”) - wi (a] , 


. . . , jdx^ . . . dx'^ 






otherwise. 



There are three possible cases. 
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(i) li M C\Uj =0 then dM f\Uj = $ and both integrals vanish. 

(ii) If M n ^ 0 but dM C\lAj = % the right hand side vanishes 
because we integrate over an empty set. The left hand side vanishes by 
our calculation 

(iii) Assume that M nUj and dM n Uj ^ 0. Then 



/ d(/jo;) = f o;{(0,x^,...,x”)da:^...dx” ■ 

J(aj,bpx-x(a},bf) 

= / x”)dx^ A • • • A dx” 

JdMnUi 



-L 



dMnUi 






since the pull back of dx^ to dM vanishes and therefore also the n — 1- 
form 



o;j(0, . . . , A • • * A dx'^ A • • • A da:”' pulled back to 8M{i > 2). 



We conclude the proof by summing over all local integrations: 




%L 



dMnUi 



fjU) = 




UJ. 



Corollary 2.5.4. Let M be an n- dimensional^ oriented, compact, real 
manifold and co G Then da; = 0. 

Proof, Since M is a manifold without boundary, dM ~ 0 and duj ~ 
IdM ^ /0^ = 0. I 

As an application of Stokes’ theorem we prove that for every even- 
dimensional unit sphere every vector field must have a zero/ This theo- 
rem is also known as the ^‘theorem of the hedgehog” since it shows that 
it is impossible to perfectly comb an “ideal” hedgehog. 

Lemma 2.5.11. Let M be an n- dimensional, oriented, compact, real 
manifold and f,f:M-^M homotopic maps. Then 

[ [ ruj 

Jm Jm 

for alluJE 

^ Recall that we only consider smooth vector fields 
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Proof, Lemma 2.5.6 implies that there exists a differential form Q E 
with f*co - Pu) ~ do. Hence the assertion follows from Corol- 
lary 2.5.4. I 



Theorem 2.5.6. Let S'^ be the unit sphere in and V be a vector 
field on S'^, If n is even then there is a point xq € S'^ with V{xq) =0. 

Proof, If V{x) 0 for all x E we can normalise V with respect 
to the Euclidean scalar product (•, of i.e., we can assume 

without loss of generality that {V^V)^r,+i = L For each x e S‘^ we 
can identify V{x) with a tangent vector of and, since 

and are canonically isomorphic further with a point in R^+^. Our 
normalisation implies then that we have defined a map V \ S'^ S'^, 

Let 7 be an integral curve of V with 7(0) = x. Then {'y{t)^'y{t))^„,+i = 1 
implies 0 = (7(i),7(<))R»+t)|i=o ^ 2 (x, F(x))ji,.+i. Hence V{x) is 

perpendicular to x and the homotopy 

F: [0,l]xS^^S^ 

X t-> cos(7rt)a; -1- sin(7rt)y {x) 

is well defined. It satisfies F{0,x) — x and F(l,o:) = ~x for all x e S'^, 
Since —id is homotopic to id and the diffeomorphism —id changes the 
orientation of 5^ (for n even) Lemma 2.5.11 implies that for every uj E 

f (jo — — f (— id)*o; ~ — f oj, 

holds. This in turn implies a; = 0 for all n-forms to which is certainly 
not true. Hence our initial assumption V{x) ^ 0 for all x E S'^ must be 
wrong. I 



2.6 Connections and projective structures 

There is one feature of which we have ignored so far in our efforts 
to localise spacetime. Given any two different points in hP there is a 
unique line passing through them. This global structure has an infinites- 
imal counterpart given by the directional derivative of vector fields. In 
fact, these lines are exactly those curves 7: [a, 6] which satisfy 

07(7) = 0. (Observe that this expression is well defined, i.e. for any 
vector field V with V(j(t)) — j(t) we have 0^(7) = 0.) 

Recall that in Chap. 1 we have relied on this affine structure in order 
to introduce inertial observers. Here we will introduce a generalisation of 



p. 89 i 
li P. 125] 
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it to general manifolds. This will be done by generalising the directional 
derivative D. 

In the affine space the difference of a derivative of a map 'tjj: 

and a vector field 1/ : A^ ^ is blurred. In fact, let {ei, . . . , Cn} 

be the standard basis of K^. Since 

- the derivative of ?/? in direction u? at cc is given by or + ^w^ei and 

- the derivative of V in direction ti; at a; is given by 

it is difficult to see the difference between both kinds of derivatives. 
Accordingly, they are commonly both denoted by D. Consider now a 
map (j): M ^ M and a vector field on M. The derivative of <f> at x in 
direction is now given by whereas the analogous 

derivative of V is given by TxV(wx) 6 Ty(^)TM. We do not obtain a 
vector but an element in the tangent bundle of the tangent bundle. It 
follows that this derivative cannot be used for defining straight lines. 

In order to obtain an analogue of the directional derivative with val- 
ues in TM we will need an additional structure, a connection. The follow- 
ing definition is an expression of the idea that infinitesimally a connection 
V should be the same as the usual derivative D. 

Definition 2.6.1. A covariant derivative or connection V is a map 

V : To^(M) ^ V ^ W, VV{W) =; 

such that for all vector fields 17, V, W, all functions /, /i, and all ce, /3 G R 
the following holds. 

(^) fV -h hW^ = /Vyt/ + hV^rU , 

(ii) pV) — + |3V^rV f 

(in) VwfU - {W . f)U + fVwU. 

The torsion of V is the tensor field 

(17, V) ^ Tor (17, V) = - VyU - [[/, F]. 

A covariant derivative V is called torsion- free if in addition to (i)~-(iii) 
the equation 

(iv) Tor = 0. 

holds. 

This definition is justified by the following theorem. 

Theorem 2.6.1. A map V: Tq(M) — ^ Ti(M) is a torsion-free covari- 
ant derivative if and only if for each x e M there exist coordinates 
(x°, . . . centered at x such that 
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fdV^\ 

holds at X for all vector fields V, 



(v,y^) 



k ■ 



(2.6.5) 



Proof, Assume first that there exist coordinates such that the Equation 
2.6.5 is satisfied. Then it is clear that properties (i)-(iv) of a torsion-free 
covariant derivative are satisfied. 

Let V be a covariant derivative and (U^(p)^ be two charts 

centered at x. We denote the coordinates with respect to these two charts 
by x*(j/) = x*(y) = Observe that (D( 9 ? o ^ and 

Setting = F^bdx- and we have 

v^y = w^dx^v^dxc + r^bdxoW^V’dxc 
= Ty^^-y^a^c + r^bdx'^w^v'^dxo, 



where V°’ = W°' = and Sja = ^d^h. Hence we obtain 



(V^y) = Ty“a 5 » y<= + r^bW‘^v'> 






as- a*- ,y-v/ 



- A 

dx^ dxf 



dx^ dx^ dx^ 
dx^ dx^ 

:— py<^aay/+^ 
dxf * 



dx^ aS“ dxf dx^ 
dx“ dx'^ 



w^vf 



\dxfdx^ dx^ dxf 

^ (w’^d ^ + ^^r\] w<^vf] 

dx^ \ “ ax'* \ dxf dx‘^ dx^ dxf “ / / 



FVom = If {VwVy = ^W^d,.V- + F^^W^Vf we get 

therefore 

_ dx^ d^x^ dx^ dx^ dx^ 
dx^ dx^ dx^ dx^ dx^ dx^ 

Now we can show that a covariant derivative is torsion-free if and only 
if there exist coordinates such that the equation in the theorem holds. 
Expressing the condition of being torsion-free in coordinates we see that 
it is equivalent to in any coordinate system 

If the covariant derivative is induced by coordinates in which the 
vanish at x, then in any other any coordinate system (rr® , • • • ? ^ ) we 

have Pj^ — — IItt which is clearly symmetric in e and /. For the 
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converse, assume that the r°(, are symmetric in a, b and (without loss of 
generality) that 5“ (a;) = 0. We will now consider a quadratic coordinate 
transformation of the form = x“ + where is symmetric 

in b and c. At x“ = 0 we have then 









_ AC 

dx^dx^ 



whence — F^^ + at =0. Our assertion follows by choosing 

Kb ^ - {f'ab) I 



For later reference we collect the coordinate expressions derived in the 
proof of Theorem 2.6.1 in the following corollary. 

Corollary 2.6.1. Let be a local coordinate system on a 

manifold with connection (M, V). Then there exist functions F^^ = F^^^^ 
suck that for each vector field V the covariant derivative \7V is given by 

and the function transform under a coordinate transformation 

according to 

pa _ ^^ 3 ;^ dx^ dx"^ dx^ . 

dx^ dx^dx^ dx^ dafi dx^ 

Lemma 2.6.1. Let (M, V) be a manifold with connection. There exists 
a unique extension of V to general tensor fields, 

V:7;'-(M)->r4i(M), rP^Vi> 

such that Vy: TJ'(M) Tf^{M) is a derivation for every tensor field 
V, 



Proof It is clear that we can extend Vy to tensor fields as a derivation. 
Uniqueness follows directly from Proposition 2.4.2. I 



In special relativity, timelike straight lines represent freely falling parti- 
cles and lightlike straight lines represent light signals. In Euclidean space, 
straight lines are the shortest curves between any two points. One way 
to define a straight line 7: (a, 6) i~^ is to require that the accelera- 
tion 7 = 07(7) vanishes. This definition carries over to manifolds with 
connections as follows. 

Let t V{t) he B, vector field along a curve 1 7(t). Then we can 
extend U to a vector field V which is defined in a neighbourhood of the 
path of 7. It follows from Corollary 2.6.1 that the covariant derivative of 
V restricted to 7 in direction 7 is independent of the extension V. Hence 
the following definition makes sense. 
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Definition 2.6.2. Let V he a covariant derivative, t ^ 7(t) a curve, 
and t ^ V{t) a vector field along 7. 

(i) Then V{t) = the 

covariant derivative of V along 7. 

(ii) A pregeodesic is a curve 7 satisfying V^7 || 7. 

(Hi) A pregeodesic j is a geodesic if^A^j = 0. 

(iv) A geodesic is complete if it is defined for all t gK. 

(v) A projective structure is a maximal equivalence class of connec- 
tions which all have the same pregeodesics. 

The notation (t) can be justified as follows. Let W be any vector 

field with W(y{t)) = V{t). Then we have 

(V^(i)W^) o 7 = ((/06W“ + n^crw<^)da) o 7 

= (l(H/0 7)“ + r,“7w<=0 7)a„ 

= (^^“ + n“7V^)0„. 

Hence the notation comes from identifying V and W which are different 
maps but assign to each point j{t) the same vector. 

We will now introduce the notion of parallel transport The follow- 
ing motivation may seem to be mathematically imprecise. Nevertheless, 
mathematicians used such arguments (before the advent of the French 
Bourbaki school which introduced a new level of precision in mathemat- 
ics) in order to introduce connections. These arguments capture very well 
some of the intuition which leads to the notion of a covariant derivative 
and are therefore worth knowing, even though they have to be taken 
with a grain of salt: One may view a covariant derivative as a connec- 
tion between infinitesimally neighbouring tangent spaces. Let V be a 
vector field, x e M and ^ € T^M. The idea that T^M is the infinitesi- 
mal approximation of the manifold M near x is sometimes expressed by 
stating that x + ^ is a point infinitesimally close to x. (Strictly speaking, 
this “addition” of points and vectors in general manifolds does not make 
sense. Assume that M is a submanifold of Then we can identify 

TxM with a linear subspace of and a: + C <^oes make sense. This 

point will in general not lie on M but still be close to M if ^ is small.) Let 
Vx 6 TxM which we want to “parallelly” translate to the point x 
We will then have Vx+^ — Vx Sv where dv is small. Since this is an 
infinitesimal process Sv should depend linearly on v and the difference 
vector This bilinear map connects the tangent spaces of our infinitesi- 
mally neighbouring points. In coordinates, we have Sv^ = Con- 

sider now a curve 7 between two distant points x,y e M. We can divide 
the curve into infinitesimal segments and parallelly translate a vector 
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Vx e TxM successively along these infinitesimal curve segments. The 
concatenation of these infinitesimal parallel translations gives a linear 
map P^: TxM — ^ TyM which in general will depend on the interme- 
diate tangent spaces and therefore on the curve 7 . These ideas will be 
made precise in Definition 2.6.3 and Proposition 2,6.1 below. 

Definition 2.6.3, Let (M, V) be a manifold with connection and 

7 : (a, b) M 

be a curve. A vector field V along 7 is called parallelly transported 
along 7 ifVjV{t) = 0 for all t € (a, 6 ). A parallelly transported vector 
field is often simply called parallel. 

In particular, a geodesic is a curve whose tangent vector is parallelly 
transported. For this reason, geodesics are sometimes called auto-parallel 
curves. 

Proposition 2.6.1. Let (M, V) be a manifold with connection and let 
j: [a, b] M be a curve which can be smoothly extended into both di- 
rections. The map 

P^: T^(a)^ ^7(6)^) v\-^V{b), 

where V : [a, b] — > TM is the unique parallel vector field along 7 with 
V (a) = V, is a linear isomorphism. 

Proof. Since 7 ([a, b]) is compact, there exist finitely many charts which 
cover the curve 7 . Without loss of generality we can assume that 7 
is contained in a single chart. (Otherwise we could divide 7 in seg- 
ments which are contained in single charts. A successive application 
of the proposition to these segments would imply the proposition in 
the general case.) The equation =0 Vt € (a, fe) reduces to 

a system of first order differential equations for the coefficients V^{t): 
ya pa^^byc _ j|. foPows from the fundamental theorem for ordinary 

differential equations (cf. Theorem 2.4.1) that the map P^ is well de- 
fined. Letting j(t) j{a -\-b — t) and V{t) ~ V(a^b~t) we clearly have 
VjV{t) = 0 if and only if V^V{t) — 0. This implies that = P^ 

and a fortiori that P^ is an isomorphism. I 

It is possible to recover the covariant derivative from parallel transport. 
The relation between these concepts is similar to the relation between 
the Lie-derivative and the Lie-transport. But note that there is no Lie- 
transport along a single curve. 
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Proposition 2 .6.2. Let (M, V) be a manifold with connection and let 
j: [a, b] M be a curve which can be smoothly extended into both di- 
rections, For each s e [0,b - t) let • [0? s] a ^ ^{t s — a). 

Then for any vector field V (t) along 7 we have 

Proof Let W{a) = ^ + s)). Then = 0 implies — 

Using a Taylor expansion it is easy to see that there 
exists a smooth vector field U along with 

W{a) = W{0)-ar,^,W'={0)^l^-^%fj^ J da + U(a)a\ 

Setting <7 = s we obtain from 1U(0) — V{ti-$) and 
the equation 

+ 5) - V{t) = V{t + s) - y(i) + srsy{t)Y{t)da - u{s)s‘^ 
which implies the assertion. I 

Corollary 2.6.2. A connection is uniquely determined by its parallel 
transport. 

Since inertial observers played an important role in Chap. 1, a thorough 
understanding of geodesics and projective classes should be important for 
the globalisation of the results in Chap. 1 (cf. Chap. 3), But they are also 
of independent geometric interest. In fact, the classical development of 
Euclidean geometry is built on the concept of straight lines (and therefore 
on the concept of projective classes). 

Theorem 2.6.1 implies that torsion- free connections are a straightfor- 
ward globalisation of the usual derivative of vector fields in vector spaces. 
Proposition 2.6.3 shows that also from the viewpoint of geodesics it is 
sufficient to consider only torsion- free connections. | p. 15^ 

Lemma 2 .6.2. Let V, V be connections on a manifold M, Then their 
difference V ~ V is a tensor field. 

Proof. We have to show that (U,V) ^ S{U,V) = VjjV - VjjV is 
function-linear. The only non-trivial part of this assertion follows from 

S{U,fV)^WuifV)-^uifV) 

= {U • f)V + fVuV - {U . f)V - fVjjV 
==f^U^-f%V = fS{U,V)- 
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Proposition 2.6.3. Let V be a connection. Then there exists a unique 
torsion-free connection V which has the same geodesics as V. 

Proof. We define VyW = VyW - ^Tor(y, W), where Tov{V^W) ~ 

VyW “ - [V, W] is the torsion tensor of V. Since Tor(W, W) = 

0 for all W, both connections have the same geodesics. Further, V is 
torsion- free by construction. For uniqueness note that if we add any 
additional, non- vanishing, in the covariant entries skew symmetric tensor 
field S e V we loose the property Tor = 0. On the other 

hand, for any non- vanishing, in the covariant entries symmetric tensor 
field S e T^i^) there exists a vector with S{vx,Vx) ^0. This implies 
that the geodesics with respect to the connections V and V T 5 which 
have initial velocity Vx do not coincide. I 

However, there exists infinitely many torsion- free connections with the 
same pregeodesics. This means that each projective class contains in- 
finitely many torsion- free connections. 

Lemma 2.6.3. Let V and V be torsion-free connections and ^ be the 
projective structure generated by V. Then V G if and only if there 
exists a one- form 9 such that V — V = 0 0 id -I- id 0 0 (orj in coordinates, 

Proof Let - P^^, Then their pregeodesics can coincide only if 

II v°' for all vectors This implies E^^v^v^v^ ~ Ef^v^v^v^ = 0 
for all vectors v which is equivalent to Since E is 

symmetric in the lower indices h, c we get 

^bcd * ^{d^hc) 

= I ( {sts^cd - ) • 

Contracting the indices e and b gives 5K == nEf. - Ef^ -h ET - b^E^i. -f 
= (^ + l)^cd - where ^d (n T It follows 

that r-, = 

For the converse notice that VyV — VyV + 6{V)V, whence both 
connections have the same pregeodesics. I 

The following corollary will be used in Sect. 3.2 

Corollary 2.6.3. Let (M, V) be a manifold with connection. Let V be 
a connection such that for every x e M there is an open set iXx cTxM 
such that the pregeodesics with initial velocity Vx € ilx with respect to V 
and V coincide. Then V and V generate the same projective structure 
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Proof. We use the same notation as in the proof of Lemma 2.6.3. It is 
sufficient to note that = 0 for all vectors u € il^c 

already implies =0. I 

Written down in coordinates, the geodesic equation 7^ + = 0 

reduces to a system of second order differential equations on M. Alter- 
natively, it can be considered as a system of first order equations on the 
tangent bundle TM. In coordinates, this system of differential equations 
is given by 

= = -'’ft"*’''- 

The corresponding vector field on TM is called the geodesic spray P C 
Tq{TM)^ and can be invariantiy defined by 

P(vx) To7t;3; (5t)? 

where 7^.^, is the unique maximal geodesic with 7^,,, (0) = x and (0) = 
Vx^ The following proposition justifies the definition. 

Proposition 2.6.4. Let (M, V) be a manifold with connection and P 
he the geodesic spray. If X: TM is an integral curve of P, then 

tttm o X is a geodesic. Conversely^ for every geodesic 7 there exists a 
unique integral curve X of P with tctm o A = 7. 

Proof Let G TxM and 7^^, be a geodesic with 7(1^0) — Vx- This geodesic 
defines a curve TM. Clearly, tttm ° A^^ = 7^,.,. and 

^ 'I'tXv^idt) = Ttjv^X^t) = To^(7v:,; o'7t)(^t), where n is the 
translation s t + s. Since ° T-t)(0) is the velocity vector of 7^^ 

at t, A^,,^ is an integral curve of P. 

Conversely, let A be an integral curve of P and consider the geodesic 
7A(to) with 7A(to)(^o) == A(to)- By the construction above, ^x(to) is an 
integral curve of P. Since its initial point in TM is A(^o), it must coin- 
cide with A(to) by the uniqueness part of the fundamental theorem for 
ordinary differential equations (Theorem 2.4.1). I 

We have seen that a manifold is locally isomorphic to (considered as a 
set with differentiable structure). It is not true, however, that a manifold 
with connection is isomorphic to with its affine structure. In fact, for 
a manifold with connection there do not generally exist charts which 
map geodesics into straight lines. For this to be the case a necessary 
condition would be that the connection is in the same projective class 
as the canonical connection of A^. We will now show that one can still 
identify the geodesics which pass through a given point with all straight 
lines in A’^ which pass through a given intersection point (cf. Lemma 
2.6.4). 
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As a consequence of Proposition 2.6.4 and the fundamental theorem 
for ordinary differential equations, each x ^ M has a neighbourhood 
W of 0 in Ta;M and there is a 5 >0 such that for all u G W the 
geodesic 7 ^,: {—5^5) — > M with 7 ( 0 ) = v is defined. By choosing >V 
small enough we can normalise the interval [-<5, 5]. In fact, observe that 
iav{t) ~ ajy{at) for any a G K. Hence for every x e M the zero vector 
0 G TxM has a neighbourhood U C T^M such that for dll v e U the 
geodesics t f-> with initial velocity v is defined on the interval 

[- 1 , 1 ]- 

Definition 2.6.4. Let (M, V) be a manifold with connection. The map 



exp: {u G TM : 7 i;(l) is defined} M 

V exp(w) exp^(t;) := 7 „( 1 ), 

where x = TtTMiv), is called the exponential map 0 / V. 

It follows from the fundamental theorem for ordinary differential equa- 
tions that the set of all u G TM, for which the integral curve A of T 
with A( 0 ) = is defined up to (including) parameter value 1 , is open. 
Hence the domain of exp is open in TM. This also implies that for any 
X the intersection of this domain with T^^M is open in T^cM. This set is 
also star-shaped as a consequence of the equation jav{i) = a%{at) for 
all a G M. 

Proposition 2.6.5. For each point x e M there exists a neighbour- 
hood U o/O G TjcM such that exp^, is a diffeomorphism from U onto a 
neighbourhood U of x e M. 

Proof. Let v € T*M and v = € TqT^M. Then 

Texp,(«) = ' ” 

which implies that T exp^, is an isomorphism. Now the assertion follows 
from the inverse function theorem. I 



Corollary 2.6.4. There is a neighbourhood tj o/Oa, G TM such that 
Exp: U M X M, Vy ^ {y^exp{vy)) 
is a diffeomorphism onto its image. 

Proof. The corollary follows from the fact that TExp = Ttttm ® Texp 
is non-singular whenever Texp is non-singular. I 
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Proposition 2.6.5 provides especially practical coordinates. Let {ei, . . . , 
Sn} be a basis of T^M a nd write = v'^Ci for each vector Vx € TxM. 
Then Br{0x) = {vx : < r} is a neighbourhood of Oaj e 

TxM. The map exp^,: Br{0) ”> Br{x) := exp{Br{0x)) is a diffeomor- 
phism for any sufficiently small r > 0. Hence we can define a coordinate 
system by x^{y) := (exp^~^{y)) . These coordinates are called normal 
coordinates and the corresponding chart is called a normal chart. 

Lemma 2.6.4. Let (M, V) be a manifold with connection, x G M, and 
{U,(p) he a normal coordinate chart centered at x. Then maps the 
geodesics through x onto the straight lines through 0 G K^. 

Furthermore, the Christoff el symbols with respect to the chart {U, p) 
satisfy = 0. 

Proof. The first assertion follows immediately from the construction of 
normal coordinates. 

To prove the second assertion we must only show that F^^v^v^ van- 
ishes for all vectors v G TxM. Let 7 be the geodesic through x with 
^(0) =: V. Since its coordinate expression is a straight line the coordinate 
components satisfy 7^ = 0 and therefore 0 = l^{t)-^F^^{j{t))'y^{t)Y{t) = 
At t = 0 this equation reduces to F^J^x)v^v^ = 0. ■ 

In Euclidean space, a convex U set is characterised by the requirement 
that any two points x,y ^ U can be joined by a straight line which is 
contained in U. For a manifold with connection we call a set U convex 
if any two points x,y can be joined by a unique geodesic which is 
contained in lA. We will now show that each point has a convex neigh- 
bourhood. 

Theorem 2.6.2. For each x e M there is a sequence of convex neigh- 
bourhoods Un with = {x}. 

Proof. Let {x^,...,x‘^) be a normal coordinate system and {U,(p) the 
corresponding chart. For y G Image (exp^) nU we define the distance 

function d{y) := {x^{y))^. Let Br{x) = {y e Image(exp3,) : d{y) < 

Choosing r small enough, there exists a neighbourhood W (r) of Og; G 
TM such that Exp maps W{r) diffeomorphically onto Br{x) x Br{x). 
For f — > 0 the neighbourhood W{r) shrinks to the set {0^:} C TxM. 
Because of the continuity of exp and exp(0a;) = x, there is an f G (0,r) 
such that exp{twy) G Bx{r) for all t G [0, 1] and Wy G W{r). 

We will show that (for f sufficiently small), any two points y,z € 
Br{x) are joined by a geodesic which does not leave Br{x). Let 7 be the 
geodesic starting at y with velocity vector Exp"^ (y,z). This geodesic 
joins y with z by the definition of Exp. 
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We have to show that for f small enough the curve 7 cannot leave 
The idea of the proof is as follows. Since this geodesic would have 
to re-enter Br{x), in the given coordinates it would have to be curved at 
least of order 1/f somewhere outside Br{x). On the other hand, geodesics 
are generalisations of straight lines. This should give a contradiction for 
f small enough. While this is the geometrical idea of proof, it may not be 
entirely clear from the anal3rtical implementation which we will present 
now. 

If 7 would leave Bf{x), then the map t d{j{t)) would have a 
maximum which is bigger than f . At this maximum we have o 7 = 0 

and o 7 < 0. We can directly compute 

•i%.. E;.i ((■>*)" + T°y) 

*2 d(y(t)) d(j(f)) 

where in the last equality we have used the geodesic equation. Since 

= 76c)(^)t“ 7'’ and = 0, 

we can choose the original r so small that Sab ~ ^^(y)^(bc)(y) positive 
definite for all y e Br{x). The curve 7 does not leave Bx{r) by the 
constructions of f. Hence we must have o 7 > 0 at the maximum 
which gives a contradiction. | 
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In this section, we will introduce two examples of connections which 
will both become important in Chap. 3. 



2.7.1 The Levi-Civita connection 

The following definition is a generalisation of Euclidean space 
(A^, (•, and Minkowski space (A^,r/) to real manifolds which are 
not necessarily affine spaces. 

Definition 2.7.1. A pseudo- Riemannian manifold (M^g) is a real man- 
ifold M together with a symmetric -tensor field g which is everywhere 

non-degenerate. We will often simply write (u,v) instead of g{u^v). 

The norm of a vector u with respect to g is defined by ||u|| = -\/[g(n,u)|, 
but in general it is not a norm in the sense of linear algebra (the triangle 
inequality only holds if g is positive or negative definite). Let i,j,v e 
{1 , . . . ,n} and define 
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— 1 if i = j < zy, 
^ 1 if i = j > i/^ 

0 otherwise. 

\ 



If (M, g) is a pseudo- Riemannian manifold then there is a i/ € {1, . . . , n} 
such that for each point x e M there is a basis {ei, . . . , Cn} of T^M 
which satisfies 

g{ei,ej) = {'qu)ij- (2.7.6) 

y times {n~y) times 

We say that g has signature +~??7+ ) and call u the in- 

dex of g. A basis {ei,...,en} satisfying Equation (2.7.6) is called an 
orthonormal basis and a local frame {Ex , . . .E?n} such that for each x 
in its domain of definition {Ei{x), ,..En{x)} is an orthonormal basis is 
called an orthonormal frame. 



Definition 2.7.2. A pseudo- Riemannian manifold is called a Rieman- 
nian manifold if g is positive definite and is called a Lorentzian manifold 
if g has signature (— , . . . , 

Lemma 2.7.1. Let (M, p) be a pseudo- Riemannian manifold. Then 
each X e M has a neighbourhood U which is the domain of an orthonor- 
mal frame. 



Proof, Let W be a coordinate neighbourhood of x and denote the in- 
duced GauBian frame by {5i, . . . 5n}- We apply a variant of the Schmidt 
orthogonalisation procedure to this frame. There exist functions on 
hi such that Ui{x) := satisfies f/i) ^ 0 at all points 

x e hi. We will use an induction argument in order to define a frame 
{El,, . . ,En} which is orthonormal up to a permutation of the frame 
vector fields. Let Ei — ?7i/||?7i||. Now assume that we have constructed 
pointwise linearly independent vector fields {Ei, . . . , Ei-i} such that 
g{Ek, El) — 0 for k ^ I and g{Ek, Ek) = =bl. There exist functions on 
U such that Ui{x) := Aj(x)dj does not lie in span{Ei, . . . , Ei^i} 
and 



tJi 






Ui-E 



k=i 



gjUj^Ek) 

g {^k > Pk ) 



satisfies g{Ui, UJ) ^ 0. The vector field Ei = Ui/\\Ui\\ is well defined and 
satisfies g{Ei, Ei) = ±1, g{Eu, Ei) ^ 0 for fc 6 {1, . . . , i ~ 1}. Finally, let 
{El, . . En} be a suitable permutation of {^i, . . , , En}. I 



Minkowski spacetime has a metric rj which is constant with respect to 
the usual derivative D, i.e. Dp - 0. This is equivalent to the fact that 
parallel transport of vectors is an isometry. Furthermore, D is the only 
torsion-free connection which satisfies this requirement. For a general 
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pseudo- Riemannian manifold, there exists a unique connection for which 
an analogous statement holds. 

Theorem 2.7.1. Let (M^g) be a pseudo- Riemannian real manifold. 
Then there exists a unique torsion-free connection V which satisfies 
Vg = 0 . 

This connection is called the Levi-Civita connection. 

Proof of Theorem 2.7.1. Recall that the connection V is torsion-free 
if and only if Vf/V ~ ^yU = [[/, V] for all vector fields U, V. A short 
calculation shows that the condition Vg = 0 is equivalent to U (V, W) = 
(V^y, W) + (^5 The strategy for the proof is to use these two 

conditions in order to calculate the only possible candidate for the Levi- 
Civita connection. It is then easy to verify that this candidate satisfies 
all the relevant equations. 

To exploit that V is assumed to be torsion- free consider the difference 

u (y, w)~w ([/, V) - ( v^y, w) + (y, VuW) 

- {VjjV, w) - {u, v^v) + ([[/, V) . 

If we add V {U, W) = {VyU^ W) -h (U, VyW) to this equation the right 
hand side becomes (V^/y, W) -f {VyU^ W) + {U, [V, W]) + ([[/, W], V). 
Here we can eliminate (VyU^W) using that V is assumed to be torsion- 
free: (yyU^W) = {[V,U]^W)-\-(yjjV^W). Putting everything together 
we finally obtain the Koszul Formula 

{VjjV, W)^]^(u (y, W) + V {U, W)-W {u, v) - {u, [y, w\) 

+ {V,[W,U])-{W,[V,U])). ( 2 . 7 . 7 ) 

I 

The following proposition gives a slightly more geometric characterisa- 
tion of the Levi-Civita connection. 

Proposition 2.7.1. Let (M^g) be a pseudo- Riemannian manifold and 
V be a torsion-free connection. Then \/g = 0 if and only if parallel 
transport is an isometry. 

Proof Assume first that V is the Levi-Civita connection. Let t j{t) be 
a curve and t U{t)^t ^ V{t) he two parallel vector fields along 7 . Then 

we calculate ^(t/(i),y(i)) = (C/(t),y(t)) = + 

{m, V 7 (t)y) ~ ® which implies that (J7, V) is independent of t. Con- 
sequently, parallel transport is an isometry. 
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Conversely, assume that parallel transport is an isometry and let 
be vectors. We choose a curve 7 with 7(0) = w and parallelly 
transported vector fields ?7, V with U(0) = u, 1/(0) = v. Then we obtain 
0 = {U, V) = = 

{^j(t)9)(Uy V). Att^O this implies 0 = which proves the 

assertion. | 



2*7.2 The Weyl connection 

Let M be a real manifold and ^ be a metric on M. A second metric g 
is conformal to g is there is a positive function f?: M — > R+ \ {0} with 
g — n’^g. A conformal structure C is an equivalence class of conformal 
metrics. In the next chapter we will see that the Michelson Morley exper- 
iment directly leads to a conformal structure rather than a Lorentzian 
metric. 

Given a conformal structure € there is a class of adapted connections. 
This generalises the Levi-Civita connection of the previous section. 

Definition 2.7.3. A triple (M, where M is a n- dimensional man- 
ifold, C a conformal structure on M, and V a torsion-free connection is 
called a Weyl structure if for every g G C there exists a one- form ip such 
that Vg — p0 g. The connection V is called a Weyl connection 

In the following we will use the exterior derivative da? of a p-form 
oj e Ti(M) (cf. Theorem 2.5.1). In Theorem 2.7.2 below we will also 
use the lemma of Poincare (Theorem 2.5.5). 

Readers who have omitted Sect. 2.5 can replace duab by 2\d[a(^^, 
Using this equality the lemma of Poincare can be understood in our 
special case. 

Lemma 2.7.2. Let (M, C, V) he a Weyl structure. Then the 2-form F — 
— \dp is independent of g G 

Proof Let g G € and g = Q’^g, Then we have 

Vg “ 2i?di? 0^4- 12^ Vp = {(p + 2dln f2) 0 g — (p <S> g^ 

Hence F — ~^dp — -^d<p does not depend on the choice of g. I 

The 2-form F is called the length curvature of the Weyl structure. We 
will motivate this term in Sect. 2.8.1 below. 

Theorem 2.7.2. Let (M, V) be a manifold with Weyl structure and 
X G M, Then x has a neighbourhood U such that for the induced Weyl 
structure {U, V) the following statements are equivalent 

(i) F = 0, 
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(a) There exists a g e € which has Levi-Civita connection V. 

Proof. We first show that (ii) implies (i). Let ^ G ^ be any metric and 
g = Q^^g such that Vp — 0. Then we have Vg = (p 0 g = — 

2f2d^2 ^g~ f2^dO (g) g. Hence cp = and dtp = 0. 

, For “(i) => (ii)” note that F — |d<^ = 0. Hence an application of the 
lemma of Poincare (Theorem 2.5.2) implies the existence of a neighbour- 
hood W of a: and of a function f:U~^R with df = ip. Consequently, 
V(e~*^^) = -e“*^d/ ® g d- e~^df <g ^ = 0. I 

Corollary 2.7.1. Let (M, (2^, V) be a Weyl structure and assume that 
there exists a parallel, non-vanishing n-form fi. Then there is an (up to 
sign) unique metric g e (C such that V is the Levi-Civita connection of 
g and |i?(Fi, . . . , En)\ = 1 for every g-orthonormal basis {Ei , . , . , F^}. 

Proof. For any metric g e (C we define an n-form fl as follows. We let 
{El, . . . ,En} be an orthonormal basis with dual basis {9^,. .. ,9^} and 
denote g{Ei, Ei) e {-1, 1} by e^. Then ft is defined by p = 9^ A - ‘ A9‘^. 

Since yl^(T^M) is 1-dimensional there is a unique g e € such that 
Jl~ ji. For this metric and any vector v we calculate 

0 = (^(Fi, . . . , F^)) 

n 

” (V^/i) (Fi, . . . , En) 4- P'iEi, . . . , Fi_i, V^Fi, F^-|_i, . . . , F„.) 

i=l 

n 

~ ^ ^ /^(Fi 5 . . . j F^_ I , gf^yE-i , , Ei)CiEi , Ej^j^i, . . . , E^f) 
i=l 
n 

= Y^gC^vEi,Ei)ei 



= E h {9{Ei, Ei)) - (V„g) {Eu Ei)) 

i=l 



n 1 ^ 

= - E ^^M^)9(Ei, Ei) = -^<p{v). 






Definition 2.7.4. Let (M, £, V) be a Weyl structure and j be a smooth 
timelike (or spacelike) curve. We call 7 affinely parameterised if 

for all t in the domain of definition of j. 
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It is clear that timelike pregeodesics are affinely parameterised if and 
only if they are geodesics. 

Lemma 2 . 7 . 3 . Let (M, C, V) be a Weyl structure and 1 7 6 e a smooth 
curve with g (jit), jit)) ^ 0 for all t and all g e Then there exists a 
reparameterisation 7 ( 5 ) = 7 (^( 5 )) such that 7 is ajfinely parameterised. 
//t 7 (t) is affinely parameterised then s 7 (^( 5 )) is affinely param- 

eterised if and only if there exist a, 6 G M such that s = at + 6. 

Proof. We denote ^ by a dot, (•), and ^ by a prime, (•)'. Let 7 ( 5 ) = 
7 (t(s)). Then 



= 5(V^(s(i))^(s(i))>4(s(t))) 








implies that 7 is affinely parameterised if and only if 

d^s ds '7(^)) 

dt 2 dt g{^{t),j{t)) 



holds. The first assertion follows immediately from the fundamental the- 
orem for ODES. If 1 1 -^ 7 (t) is already affinely parameterised, the differ- 
ential equation reduces to ^ = 0 and the second assertion follows. I 



[p. 132 X] 
^8 



X P, 151 



2.8 Curvature 



In Sect. 2.6 we have seen that the covariant derivative defines a notion 
of parallel transport along curves. Given a small loop 7 : [ 0 , 1 ] 1 -^ M with 
^( 0 ) = 7 ( 1 ) = a;, this parallel transport defines a map 



p. 179 X 
[X p. 141] 



: T:,M Ta:M, Vx ^ 

While in Minkowski spacetime and in Euclidean space we always have 
P^Vx — Vx, in general the vector P^Vx depends on the loop 7 . The 
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Fig. 2.8.1. The immersed surface 
in Theorem 2.8.1 



following theorem shows that if one restricts to rectangular loops then 
there exists a well defined limit where 7 {x}. 

Theorem 2.8.1. Let (M, V) be a manifold with connection andx e M, 
UjVjW € T^M. Then there exists a well defined tensor field 

R: Ta,M X T^M x T^M ^ T^M 

(u^v^w) R{u^v)w 

with the following property. 

Let U be an open neighbourhood of (0, 0) and f:U-^M be an 
immersed 2-surface such that f^di ~ u, f^d 2 = v. For any a > 0^ b > 0 
with [0, a] X [0,6] CU let ja,b be the closed curve 

7a, 6: [0,2a-f26]--./(6^)cM 





for 


0 <t < a 


f{a,t - a) 


for 


a <t < a-\-b 


/(2a + b-t,b) 


for 


a + 6<t<2a + 6 


J{0,2a + 2b-t) 


for 


0 < t < a. 



Then ^ 

lim ~ (P m — tu ) = —R(u.v)w 
(a,6)-4(0,0) a6 V Tab J ^ ^ 

holds. 

If [/, V, W are vector fields with — UjV^ = v, and = w then 
R{u^v)w is given by 

R{UjV)w = (^ij'VyW ~ VyVjjW — ^ • 

® The following section will be of great importance in Chap. 5. However, at 
this point in time it is better first to skip forward. 
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Proof. It is easy to check that 

R(u,v)w= - VyVyW - 

does not depend on the extensions U, V, W and is therefore a tensor field. 

We denote the four segments of the curve ja,b by 7a;,,.. -7a re- 
spectively. Let W{a, b) = P...2 o P i (ui) and observe that W depends 

ia,b la^b 

smoothly on (a, b). Prom the definition of parallel transport we have 
This and VP(0, 0) = immediately imply 



R(u,v)w = - 



( 0 , 0 ) 



Observe that the curve segment 7^ ^ is independent of a. In particular, 
T'f b ~ 7o,b which implies ^ w — W (0, b). Hence we obtain 






= -7 ( P ^4 o P 3 o P 2 o P i (w) ~ W 
ab V /a , 6 la^b^ 



4 fp^4 o (p 3 o fp 2 oP 1 (u;)V(P^4 )-l'j 
Clb Y '^>b y idyb y la^b ^a,b J J 



(w) 






P 3 (Wia,b))-W{0,b) 

> a,o 



If b = 0 then W is simply the parallel transport of w along 7a ^ which 
implies 

^/*5i^(®> 0) = '^7! («> 0) = 0, (2.8.8) 

and therefore 

^ i - v_f.a,iy(o,o)) 

(2-S-9) 

Since 7^ (, does not depend on a the parallel transport along 7^ P 4 , 

’ ‘ dfb 

does not depend on a either. By Proposition 2.6.2 the limit of the first 
summand is therefore given by 
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= .“ST 

In order to complete the proof we need only to show that the second 
summand in Equation (2.8.9), 

<p(a,b) = lp^4^(l(P^3^(W^(a,6)) - W(0,6)) - 

has the limit 0 for (a, b) 0, 

We will first show that this summand is continuous in (a, 6). Since 

<p(a,b) = P^4^ ((P^3^(iy(a,6)) - W(0,b)) - aV f^Q^W(0,b)^ 

depends smoothly on (a, 6), an application of the Taylor formula yields 
b) = 'ip(a) + 0(b) + abif{a, 6), 

where are smooth functions. The curve 7^ ^, is constant which 

implies P7^^ = id and therefore ^p{0^b) = 0 for all b. The curve jIq 

is inverse to the curve 7 ^ q. In analogy to Equation (2.8.8) we obtain 
V 0) — 0 and therefore 

^:?(a, 0) = P^3 , W(a, 0) - W(0, 0) - aV W( 0, 0) 

= P^3^W(a,0)™W(0,0)=0. 

Prom ip{0fb) — (p{dy0) = 0 we conclude that both ijj and 9 are con- 
stant and that their sum vanishes. Hence we have <^(a, b) ~ ah^{a^ h) — 
a6c^(a, h) and 

<P{a,b) = 

is smooth. In particular, (p is continuous. The equation (^(0, 0) = 0 follows 
now from 



lim - 

<X — >0 Q, 




(W"(a,6 ))-W( 0 , 6 ) 



for any 6 > 0 . 



^f.d^W{0,b), 
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Definition 2»8.1* The {^-tensor R is called the curvature tensor (or 
Riemann tensor j of (M, V). 

Lemma 2.8.1 (Bianchi identities). Let (M, V) be a manifold with 
connection. Then R satisfies the second Bianchi identity^ 

(Vyi?) (v,w) + (V^i?) (w,u) + (V^i?) (u,u) = 0. 

If V is torsion-free^ then R also satisfies the first Bianchi identity^ 

R{u^ v)w + R{v, w)u 4- R{w, u)v = 0. 

Proof, Consider a normal coordinate system centered at 

X e M and let U^V^W be extensions of u^v^w € T^M such that these 
vector fields have constant components with respect to our coordinate 
system. We have then [U, V]y ~ [27, W]y — \V^W]y Q for all y in this 
coordinate neighbourhood and Vjj V = Vy W = Vpy U = 0 ^;. For any 
vector field X we get 

(V[/^i?) (^, V = {R{V, W)X) - R{XuV, 

- R{V, VuW)X - R{V, W)^jjX 
= (J?(y, ty)x) - R{V, w)VjjX 

= 

-([V;7,[Vy,V^]),X 

The second Bianchi identity follows now immediately from Lemma 2.4.3, 
For the first Bianchi identify we assume in addition that V is torsion 
free. We calculate 

i?(u, v)w + R(u, w)u + R{w^ u)v 

” VfyVyW - VyVf/W + VyVyr?7 — Xy/VyU 
+ ~ Vf/Vpj/F 

- Vt/ (Vy W - VpyV) + Vy (VyrC/ - V^W) 
•fV|y (V^y-VyC/) 

= 04-0 + 0 = 0 



Definition 2.8.2. Let (M, V) be a manifold with connection. The ten- 
sor Ric(n,u) = tr(R(',ii)u) is called the Ricci tensor. We denote by 
F € 0'^{M) the 2 /n- fold multiple of the antisymmetric part of Ric, 



F{u^ u) = — (Ric(u, v) — Ric(u, u)) . 
n 



[p. 137 i] 
J. p. 184 
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Lemma 2.8.2. Let (M, V) be a manifold with torsion-free connection. 
Then F — •)) holds. 

Proof Let {Ei , . . . , E^} be a basis, {lo^^ . . . be dual basis and u,v e 
TxM, Prona the first Bianchi identity and the antisymmetry of •) we 
obtain R{u,v)Ea = ~R(Ea,u)v + R{Ea,v)u and therefore 

nF{u,v) = {R{Ea,u)v - R{Ea,v)u) 

= {R{u,v)Ea) - -tv(R{u,v)), 



2.8.1 Applications to Weyl structures 

The differential form F has a particularly geometric interpretation if V 
is a Weyl connection, justifying the following definition 

Definition 2.8.3. Let (M, £, V) be a manifold with Weyl structure. 
Then we call F the length curvature and K, defined by R{u^v)w = 
K{u, v)w + F{u^ v)w the directional curvature. 

To motivate these terms coined by WeyP we will need the following 
lemma. 

Lemma 2.8.3. Let (M, V) be a manifold with Weyl structure. Then 
F == \dip and g{K{u,v)w,w) = 0 for all vectors u,v^w and all g e€.. 

Proof. Let C/, V, W be tensor fields with Ux = u, 14 = and W4 = w. 
We can also assume that [L", V] = 0. 

^(V[/VyW,W) 

= - w) - 5(Vy W, VjjW) + V[/ {giVyW, T^)) 

= -ip{U)g{VyW, W) - giVyW, VjjW) + Vu(^V* g(W, W) 

-li^V9KW,W)) 

= -^{U)[\v*g{W,W) - ^(Vvg)iW,W)^ -9(VyW",V[/W) 
+ \u*V. g{W, W) - ^V^(<p(V)g(W, W)) 

= {<P(U)V . g{W, W) + <p{V)U . g{W, T^)) 

^ In the German original, they are called Streckenkriimmung and Richtungs- 
krilmmung. 
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+ ^^{UMV)g(W, W) - giVyW, Vf/W) 

+ ^U.V. g{W, W) - l{Vu<p)iV)g{W, W) 

- l<P{^uV)9(W,W). 

This implies (using [17, V] = 0) 

g{R{U, V)W, W) = g{Vu^v^ - Vy VjyW^, W) 

- ~l 9{W, W^) 

= -^d<p{U,V)g{W,W). 

Since W is arbitrary, we obtain tr(i?({7, V)) = ~~^dip{U^ V) which proves 
the first claim. The second claim follows from 

g{K{U, V)W, W) = g{R{U, V)W, W) - g{F{U, V)W, W) 

= -^d<p{U,V)g{W,W)-g{-\d<p{U,V)W,W)^0. 



Let w e TxM and 7 be a small loop of the type given in Theorem 2.8.1. 
Then the parallel transport is approximately 

PjW ^ w -\- abR{u^ v)w 

= w + abF{u^ v)w + abK{u^ v)w . 

V " 

It follows that 1 + afojF(u, v) is the factor by which the parallel transport 
stretches the (relative) length of w and that abK{u, v)w is the change of 
direction of w due to the parallel transport. Since the parallel transport 
of a vector does not leave its relative length invariant it is impossible to 
compare lengths at different points. There is an important consequence 
to this fact, the so-called clock paradox of second kind (cf. Sect. 3.3). 



2.9 Variation of geodesics 

In this section we investigate the infinitesimal analogue of 1 -parameter 
families of geodesics. 

This section is technical and can be omitted on first reading. It 
is a prerequisitive for Sects, CL'O'd in the chapter on pseudo- 
Riemannian manifolds. Section 4-5 is concerned with metric preserv- 
ing diffeomorphisms and used in the discussion of cosmological models 
(Chap. 6). Sect. 4-d is necessary for understanding the complete proof s 
of the singularity theorems which are presented in Chap. 9. 
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In Chap. 5 we will see that freely falling particles can mathematically 
be described by (certain) geodesics. A cloud of such particles moving in 
spacetime corresponds then to a smooth 3-parameter family of geodesics. 
It is therefore of physical interest to study families of geodesics. Here we 
will study the slightly simpler sub-case of 1-parameter families and its 
infinitesimal analogue. 

Let /; i7 -4 M be a differentiable map and recall that a (smooth) 
vector field along / is a smooth map X: E ^ TM with tttm oX{x) = 
f{x) for all X £ E, We denote the space of vector fields along / by T^{f). 
Any smooth vector field X on M induces a vector field X: x X^ :~ 
Xf(^x)' A vector field U on E also induces a natural vector field along 
/ via a: Important examples of this construction are given by 

vector fields along curves and by vector fields along canonical immersions 
of submanifolds (cf. Sect. 4,4). 

Lemma 2.9.1. Let f: E ^ M, U,V 6 T^{E), and X,Y € 

Then V jjX + rjg.(/^C/)^X^) 5^ is a well defined vector 

field along f and satisfies the following properties, 

(i) jjX is function-linear in U and M-linear in X ; 

f f 

(a) V ~ d(p{U)X + (fX jjX for all functions (p e C^{E); 

(in) V V V] = Tor(/,17, f.V), 

Proof We have to show that the definition is invariant under coordinate 
transformations. Let ^ be a diffeomorphism of E and 0 be a diffeomor- 
phism of M. Then we obtain 

= ({u^dpX’^ + rsMfu^)X'^)da)o^-\ 

Hence the formula does not depend on the coordinates chosen for E. 
Let (^ : M M be a diffeomeorphism. We obtain 

+ n,{d 0 {<j> o ffu^)(ddr) o fx^)d„ 

= (u>^{ideddr) ofd^rx^ + {d^r) ofa^x-^) 
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+ (9e/) ofdpru^id^r) ofx^)da 

= o / + {{d,dd 4 >-) o / 

+ (0e/) o / o f)d 0 rX<^) da 

The Gaufiian basis vector field with respect to the coordinates induced by 
(f) are given by Taking this into account we see that the Christoffel 
symbols transform as given in Corollary 2.6.1. This implies that our 
coordinate formula defines a well defined vector field along /. Equations 
(i)-(iii) follow directly from our coordinate expression. I 

If / is an immersion then the covariant derivative along / can be calcu- 
lated entirely in M. 

Lemma 2.9.2. Let f : E M be an immersion, U,V e Tq{E), and 
X,Y € Toif), Let U,X be vector fields on M which coincide with fM 

and X at all points y = f{x). Then we have V ijX ~ (XfjX'^ o f at all 
xeE, 

This lemmajustifi.es writing V f^jj^ instead of {XfjX^ of. We will use 
this notation extensively in Sect. 4.4. 

Proof of Lemma 2.9.2. Let x ^ E. Since / is an immersion there exists 
a neighbourhood of W of a:, a neighbourhood V C of 0, and a 

local diffeomorphism F:U xV ^ M with F{x, 0) = f{x) for all x 
We may extend U, X toU xV such that tj ~ F^U and X ~ X o F~^. 
Then we obtain 

{xfjxf O F = U^daX’^ oF- o F 

= diF^Wda{X^ o F"^) o F - {Fla o F) {F^U^X^ 

= diF^^U^djX^iF-y o F - {F^ o F) (F*F)“X" 

= WdjX‘’ - {Fla o F) {F^UfX'^. 

Restricting the last expression to W x {0} gives 

(VyX)oF(rr,0) = (v 

for qL\.x C E. I 

Lemma 2.9.3. Let f \ E M , U,V e T^{E), and X € T^{f). Then 

the equation 

R{f*U, f^V)X =V jj V v^~ V y V jj^~ ^ [C/, V]^ 



holds. 
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Proof, The equation follows directly from the definition of the Riemann 
tensor, I 

Observe that / does not need to be an immersion and therefore /(i7) 
may not be an immersed submanifold. This is important for the following 
application. 

Definition 2.9.1. A geodesic variation is a map f: (-(5,5) x (a, 6) 

M, (Syt) f{s,t) e M such that for each s the curve t f(s,t) is a 
geodesic. We denote the velocity of the geodesics by ft := T(^s^t)f{dt) and 
the deviation vector field along the geodesic f{s^ •) by fs := T{s,t)f{ds) ■ 

Proposition 2.9.1. Let f : (s,t) /(<s,t) £ M be a geodesic variation. 

f 

Then ft satisfies the geodesic equation v = 0 and the deviation 
vector field fs satisfies the equation 

V V d^fs + Rifs Jt) ft - (V a Tor)(/i/,) - Tor(/t, V a,/«) = 0- 

Proof. The geodesic equation follows directly from the definition of a 
geodesic variation. Observe that [Z^,/*] = [Mds)J*(dt)] = = 

0 and that therefore 

=0 

V aZ djs =v a,v ojt+ v a*L^+ v a, (Tor(/t, /,)) 

=0 

= Rift, fs)ft+ V a,V dtft + (V atTor)(/t, fs) 

+ Toiift,i djs)- 



It is often sufficient to consider the infinitesimal analogue of geodesic 
variations. This justifies the following definition. 

Definition 2.9.2. A Jacobi field is a vector field J along a geodesic 7 
which satisfies the Jacobi equation 

J + R( J, 7)7 - (V^Tor)( 7 , J) - Tor( 7 , V^J) - 0. 

Proposition 2.9.2. Let 7 : [a, fe) M fee a geodesic. The Jacobi fields 
along 7 span a 2n- dimensional linear space and any Jacobi field J along 
7 is uniquely determined by J(a), 

Proof. Without loss of generality we may consider a single chart which 
contains the geodesic. The Jacobi equation reduces then to a system of 
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n ordinary second order differential equations, or, equivalently, to a sys- 
tem of 2 n ordinary first order differential equations. Hence the assertion 
follows from the fundamental theorem for ordinary differential equations 
(cf. Theorem 2.4.1). I 



Corollary 2.9.1. Let 7 ; [a, 5] M, 1 1 — > exp^^. {{t — a)ux) be a geodesic. 

A vector field J along ^ is a Jacobi field which vanishes atx — 7 (a) if and 
only if there is a vector Vx gTxM with J = exp {{t — a) {ux sVx))^ dg. 

Proof. It is clear that, given such a vector J is a Jacobi field along 
7 which vanishes at x. Proposition 2.9,2 implies that the Jacobi fields 
along 7 which vanish at x span an n-dimensional vector space and are 
characterised by their velocity vector J. The assertion follows since 

exp ((t - a)(ua: + svx))* == 

for all Vx ^ TxM and TxM is an n-dimensional vector space. I 



Definition 2.9.3. Two points x^y e M are conjugate if there is a 
geodesic 7 joining x and y and a non- zero Jacobi field J along 7 which 
vanishes both at x and y. 

Proposition 2.9.3. Two points x^y e M are conjugate if and only if 
there exists an Ux £ TxM in the domain of exp such that exp(uo:) = y 
and Tu„, exp^ : Tu,,TxM ~^TyM fails to have maximal rank. 

Proof If X, y are conjugate, there is a geodesic 7 : [0, 1 ] — M joining these 
two points and a Jacobi field J along 7 which is non-zero but vanishes 
at X and y. Let Ux be the uniquely determined vector which satisfies 
exp{tUx) = j{t). By Corollary 2.9.1 there exists a vector Vx £ TxM \ {0} 
such that J{t) = T^o,t) ^y^p(t{ux + swx)){ds). The assumption J(l) = 0 
implies that the linear map T(o,i) exp(t(ua; -h K? — ^ TyM does 

not have maximal rank which in turn implies that T^,,, exp ^ : Tu.,,TxM 
TyM does not have maximal rank. 

To prove the converse assertion, we just choose the vectors Ux^Vx 
by the requirement that exp^{ux) = 0 and T^,^ exp = 

0. The vector field J{t) = T(o,^) exp(t(ua, + sWx)){ds) along j{t) = 
exp^{tUx) is then a non-zero Jacobi field which vanishes at x and y. 



Proposition 2.9.4. Let 7 : [a, 6 ] M be a geodesic without conjugate 
points. For every pair of vectors w^(^a) ^ T^(a)M, £ T^{p)M there 

is a unique Jacobi field J along 7 with J{a) — J{b) — 
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Proof, There are vectors e T^(a)M with 7(f) = exp^(^)((f-a)iA^(^)) 
and e Ty(6)M with 7(f) = exp^(^)((6 - t)w^(6)). Since 7 does not 
have conjugate points the linear maps 



'^(b-a)u^(a) ^^P7(a) ' ^(t“o)'U7(a)^7(ct)'^ ^ ^ 7 ( 6 )-^ 



and 



^(6— o)u^(i,) ®^P7(6) ‘ ^{h~a)u^(^iy)^'y{b)^ ^7(a)-^ 

are both isomorphisms. Hence there are vectors and such that 

'^7(f>) ^ ^(6-a)u^(„,) ®^P7(a) “ ^^ds js-0 *^^ 7 (^ 1 ))^ ’ 

'^7(0) ^(d-a)u^(f,) ^^P7(6) ~ ^^ds| -0 ^^7(6) 

Let Ji , J2 be the Jacobi fields defined by 

J\ = ^(o,t) exp((t - a){u^(o) + 5 u^(a)))(^s), 
h = T(o,t) exp((6 - i)(u^(6) + su^( 6 )))( 5 ^). 

Then Ji vanishes at 7(a) and has the value w^{b) at 7(6) whereas J2 
has the value at 7(a) and vanishes at 7(6). Observe that the sum 
Is well defined since tttm^ Ji(t) — 'KTM^J2{t) = 7(^). By the 
linearity of the Jacobi equation the vector field J = Ji + J2 along 7 is also 
a Jacobi field which has correct values at both 7(a) and 7(6). This proves 
existence. Uniqueness is clear since the space of solutions has dimension 
2n which is just the dimension of the vector space ©Ty(^)M. I 

Jacobi fields can also be used to calculate the differential of the exponen- 
tial map restricted to an n-dimensional ^Vertical” subspace of Tu.^,TM 
which consists of all V[y,] = +tUa;)|i=o ^ Tu,.TM^ where Vx € T^M, 

Proposition 2 . 9 . 5 , Let x e M, u^^Vx € TxM, and J be the Jacobi 
field along t 7(f) = ex-p{tUx) with J( 0 ) = 0 and V^J(O) = Vx, Then 
the differential of the exponential map in direction is given by 

Tu. exp3,(t;[„]) J(l). 

Proof, Let /: {sff) ^ exp^(f(u + su)) € M and 

ft — T(^s,t)f {9t) — ^t(u+sv) ®^Pa:(('^ ^'^)[t(u+sv)])> 

Since / is a geodesic variation with /(5,0) ~ x Vs the vector field 
(/s)|a=o Is a Jacobi vector field along 7 which vanishes at 0. Prom 
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IfsJt] = 0 we obtain =V gjs =V djt + Tor{fs,ft). At t = 0, 
we have /« = 0 and ft = Toexp^{{u + sa;)[o])), whence V^J(O) = 
+ «^)[o)) = ^oexp^((t;)[o]) = v. The Jacobi vector 
field J also satisfies J(l) = T(04)/(9s) = T(„) expj,(t)[„]). The asser- 
tion follows since initial value and derivative characterise Jacobi fields 
uniquely. I 




3. Space and time 

from a global point of view 



The content of this chapter is mainly physical In Sect S. 1 we show 
that the experiment of Michelson and Morley indicates that spacetime 
admits a conformal structure, A conformal structure is not sufficient 
to describe spacetime adequately. In Sect. 3.2 we generalise the no- 
tion of inertial observers which leads to the existence of a projective 
structure. ( One of Einstein’s key observations was that this projective 
structure is closely linked to the phenomenon of gravity. This will be 
pursued in Chap. 5). 

In Sect. 3.3 we use our physical postulates in order to show that 
the conformal and the projective structures of spacetime form a Weyl 
structure. Here we closely follow (Ehlers, Piranf and Schild 1972). 
The proofs in this section are technical and can be omitted without loss 
of continuity. In Sect. 2.8 we introduce a further physical postulate 
which restricts the Weyl structure to a Lorentzian manifold. 
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3.1 Light rays: the conformal structure 

In Sect. 1.4 we have seen that spacetime is endowed with an invariant 
field of light cones (cf. Postulate 1.4.1). In analogy to the discussion in 
the previous chapter we will define these light cones infinitesimally, i.e., 
in the tangent spaces rather than in spacetime itself. 

The discussion in Chap. 1 may seem to indicate that for each x e M 
the tangential space T^:M can be identified with {MT.tj). Since for each 
non-degenerate bilinear form of signature there exist 

linear coordinates (x®, . . . , x’^”^) such that = -d (x^)^-h(^ijdxMx^’ 

(i,j€{l,...,n-l}), one may be tempted to simply replace Minkowski 
spacetime (A^, rj) by a general Lorentzian manifold (M, g). However, the 
Michelson-Morley experiment only determines the paths of light rays.^ 
In other words, from the Michelson-Morley experiment alone one can 
only infer the existence of a conformal structure where 

{fPrj : Q € C^(A’^,E“^\{0})}. (In Sect. 1.4 we used the affine structure 

For the definition of wave length we needed the Euclidean structure of 
space. This seems to indicate that implicitly we used a Lorentzian metric 
rather than a conformal structure in order to interpret the Michelson-Morley 
experiment. However, the outcome of this experiment is a null-effect, i.e., 
AZ ^ 0 which is independent of the Euclidean structure chosen. 
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of to single out a constant representative rj e trj. This is not possible 
for general manifolds.) 

Hence in a global setting the Michelson-Morley experiment leads to 
the following postulate. 

Postulate 3.1.1 (Existence of a conformal structure). 

(Conformal) spacetime is a pair (M, (T), where M is a n-dimensional 
manifold and € a conformal structure of signature T). This 

conformal structure is given by the paths of light rays. 

We will call a conformal structure of signature Lorentzian. 

We will see that, given a Lorentzian conformal structure, it is possible 
to recover light rays (cf. Postulate 3.1.2 below and the discussion leading 
to it). 

Definition 3.1.1. Let (M, C) be a manifold with Lorentzian conformal 
structure. A null (or lightlike) hypersurface N is a hypersurface such 
that (for any g ^ €) the induced metric on N is positive semi-definite 
but not positive definite. 

Let (M, (t) be a manifold with a Lorentzian conformal structure and 
C M be a null hypersurface. At each point x £ N there exists a 
unique 1-dimensional subspace lx C TxM which is tangent to N and 
satisfies g{v,v) = 0 fox oil v e lx, g ^ For, if there where two such 
vectors Vi,V 2 6 TxN which were not collinear then there would exist 
a vector w e span{?;i,7;2} with g{w^w) < 0 for all w ^ But this is 
impossible since g restricted to TxN is positive semidefinite. If U, V are 
vector fields with Ux e lx \ {O^}, Vx e lx \ {0^} for all x € A" then the 
integral curves of V and U in N are rep arameterisat ions of each other. 

Definition 3.1.2. Let (M, C) be a manifold with Lorentzian conformal 
structure and N d M a null hypersurface. A curve j in N whose velocity 
vector satisfies 5^(7, 7) = 0 for every metric g e ^ is called a conformal 
null geodesic 

Lemma 3.1.1. Let ^ be a conformal null geodesic and g e €. Then 7 
satisfies the differential equation 

7“ + 9"'^ (dt,gdc - ^ 9 <i 56 c) 7V II 7 “- 

Proof. First we will show that the coordinate expression is invariant with 
respect to coordinate transformations and with respect to changes of the 
representative g £ €. If V is the Levi-Civita connection of g then we have 

(V^7)“ =r+ - \dd9b?j 77^ 
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whence we have only to show that for any other metric g G ^ with 
Levi-Civita connection V we have V^7 || V^7. But this follows from 

g-2/^arf ^dt,{e^^gdc) - gtc)^ = 5'“'^ (^dbgdc ~ ^ddgtc'^ i’i'' 

=0 

Let iV be a null hypersurface containing 7. We may chose coor- 
dinates for N such that di spans lx for each x. Since 

the bilinear form g\8pan{d2,...,dr,.-i} positive definite and g^xN does 
not have full rank, we have gu = OVi G {1, . . . , n - 1}. We may ex- 
tend the coordinate system to such that g{do,di) = -1 

at N. In these coordinates we have (after normalising 7) 7 — 5 i and 
we obtain g^^ {db9dc - ^ddgbc) Since in our co- 

ordinates we have 7 = 0, the lemma is proved once we have seen that 
9'''^ {-9d9n) II ( 5 i)“ or, equivalently, ddgn || Y9ad ^ This equa- 
tion follows from digu = 0 G {1, . . . , n — 1}. I 



Corollary 3 . 1 . 1 . Let v G TxM \ {0} be a vector with g(v, v) = 0. Then 
there exists an (up to reparameterisation) unique conformal null geodesic 
7 through x — 7(0) with 7(0) = v. 

Proof. By the fundamental theorem for ODES, given any function A(f), 
the differential equation 

7“ + 5'“'* (^dbgdc - ^ddgt^ - A7“ = 0 

has a unique solution for any v G TxM. It is easy to see that for any two 
functions A, A the solution curves are identical up to a reparameterisa- 
tion. I 

The preceding corollary shows that at each point there is a unique con- 
formal null geodesic in any direction Rv where g{v^ v) — 0 for all g e 
This implies that there are exactly as many conformal null geodesics 
as there are light rays. In addition, it is easy to see that in the case of 
the Lorentzian conformal structure induced by Minkowski spacetime the 
light rays defined in Sect. 1.4 and the conformal null geodesics coincide. 
Hence we feel justified to link our infinitesimal Lorentzian conformal 
structure to light rays in spacetime by identifying them with conformal 
null geodesics. 

Postulate 3 . 1.2 (Light rays). The light rays of spacetime are the con- 
formal null geodesics of its Lorentzian conformal structure. 
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A Lorentzian conformal structure is all we need in order to investigate 
causality. The following definition is a straight forward generalisation of 
Definition 1.4.6. 

Definition 3*1.3. Let (t be a Lorentzian conformal structure of signa- 
ture (-,4-,...,+), g and AM c M. 

(i) A vector w is called spacelike, ifg{w,w) > 0, timelike ifg{w,w) < 
0, and lightlike (or nullj if g{w^w) — 0. The vector w is called 
causal if it is timelike or lightlike. 

A vector field V is timelike ( respectively, lightlike or null, causal, 
spacelike^ if for each x e M the vector is timelike (respectively, 
lightlike or null, causal, spacelike). 

(ii) The Lorentzian conformal structure t is time orientable if there 
is a global timelike vector field V. 

Assume that € is time orientable. A time orientation is an equiva- 
lence class of timelike vector fields V where V ifg{Vx, W^) < 0 
at some point x e M. 

Let [V] be a time orientation of €, A causal vector u is called future 
directed (respectively, past directed j if g{u,V) < 0 (respectively, 
g{u, V) > 0). 

(in) A curve 7 is called spacelike (resp., timelike, lightlike, causal, 
future directed, past directed ) if all its velocity vectors 7 are space- 
like (resp., timelike, lightlike, causal, future directed, past directed). 
A timelike curve is often called a world line when one whishes to 
emphasise that it can represent the history of a (small) material 
object. 

(iv) The chronological future of a set A relative to U is 

M [A, U) = {x ^ M \ 3 a future directed, timelike curve 

'y CU from A to x} 



The causal future of a set A relative to U is 

M {A, U) ^ {x ^ M \ 3 a future directed, causal curve 

C.U from A to x} 

There are analogous definitions for the chronological past I~ (A, U) 
and the causal past J~{AM) of A relative toU. IfU — M we omit 
the term ''relative to and write I'^iA), etc. If A = {x} is a 
single point we write I~^{xM) 

Definition 3.1.3 is independent of the chosen representative g E d. 

Lemma 3.1.2. Let x e M andU be an open neighbourhood of x. Then 
M{xM) open. 
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Proof, Let y G I~^{x^U) and 7 C be a timelike curve from x to y. Choose 
a coordinate system . . , , let V C W be a (small enough) com- 

pact neighbourhood of y, and let ^ € V fl 7. Then the (coordinate) 
straight line i. from 0 to y is timelike. By compactness of V there exists 
an q: > 0 such that any straight line t from 2: with /(£,£') < a satisfies 
snp{g{P{t),t{t)) : i'{t) G V} < lsnp{g{l{t)^l{t)) : £{t) G V} < 0. 
Hence these lines are all timelike in V. Since they fill a whole neighbour- 
hood of y the assertion is proven, I 

The set J^{x^U) does not need to be closed relative to U. However, as 
we will see later, J"^(x, W) is closed if U is chosen small enough. 

Corollary 3*1.2. LetU be open and A be any subset ofU, 

I^{A,U) - I‘^{I+{A,U),U) = I^{J^{A,U),U) - J+{I^{A,U),U) 

C J+(AW) = J+(J+(^,W),W). 

Proof The only inclusions which are not obvious are J'^(J‘^(v4, W), W) C 
I^{A,U) and J+(J+(^,W),W) C I^{A,U). 

Let X G I'^{J~^{A,U)^U). Then there exist a> y e A and a G 
M, a causal curve /x: [0,1/2] ^ M from y to z and a timelike curve 
A: [1/2,1] ^ M from ^ to x. The concatenation 7: [0,1] — ^ M of /x 
and A is a (piecewise) causal curve from y to x which is timelike near x. 
Fix a metric get and let [/ be a timelike vector field along 7 which 
satisfies U{1) — 7(1). There is an e > 0 such that g{'y{t),'y{t)) < —e for 
all t G (1 — e, 1). Let 0: [0, 1] — > be a smooth function which satisfies 
(^(0) = (j){l) — 0 and ^{t) > 0 for all t G [0, 1 — e|. For any (small enough) 
s > 0 let f{sA) ~ exp^(^)(s</>(t)l^(t)). The curves 1 /(s,t) all connect 
y with X and /(0,t) = 7(;^). We denote derivatives with respect to t by 
a dot and with respect to s by a prime. We calculate 

{9ifis,t)J{s,t))y = Vf^Q^ig{f{s,t)J 

= 2g{VjfJ) = 25(V^(<^y),7) = 2MV,i). 

Hence we have (g(f(sA)^f(s,t)))[Q^t) < 0 for all t e [0, 1 - e] and the 
curves t^f{s,t) are timelike on [0, 1 — e] for s > 0 small enough. Since 
we have ^(7(^)5 7(t)) < — e < 0 for all t G (1 — e, 1] the curves also satisfy 
^(/(s,t),/(s,t)) < -e/2 < 0 for sufficiently small |s|. This proves that 
we have obtained a timelike curve t ^-4 /(so,t) from y to x. This curve 
may have a kink at the parameter value 1/2 where the original curve 7 
passes through Using Lemma 2.1,7 and a coordinate chart it is not 
difficult to smooth out t 1— > f{so,t) near t = 1/2 while preserving that 
this curve is timelike. 

The inclusion J'^{I'^{A,U)yU) C I'^{A^U) can be shown analo- 
gously. I 
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Lemma 3.1.3* J+(^) = int(J+(4)), J^{A) C 

Proof, By Lemma 3.1.2, I^{A) = is open. The inclusion 

I~^{A) C int(J+(^)) is clear. Let x € int(J+(^)). Since int(J+(^)) 
is open, there is an ?/ e I~{x) fl int(J+(.4)). Hence x e C 

I^{J^{A)):=^I^{A), 

Let X G J*^(v4), 7 be a causal curve from A to x, and W be a neigh- 
bourhood of X. Since any small enough deformation of 7 has future 
endpoint in ZY, we can deform 7 thereby obtaining a timelike curve from 
^ to ZY. I 

Definition 3.1.4. Let (M, be a Lorentzian conformal structure^ x e 
Mj andU be an open neighbourhood of x. We call 

Cx (^) = {y ^ M : 3 a future directed conformal null geodesic 

7 C Z/Z from x to y} 

the integrated future light cone of x relative to U, There are analogous 
definitions for the integrated past light cone and the integrated light 
cone. IfU~M, we omit the term ^‘relative to M” and write C~ , 
and Cx- 

Proposition 3.1.1. Let (M, (Z!) be a manifold with Lorentzian confor- 
mal structure. Then each x € M has an open neighbourhood U diffeo- 
morphic to such that 

(i) C^iU) \ {x} is a smooth hypersurface which is diffeomorphic to 
5^-2 X R, 

(ii) C+(Z^)n/ +(g:,f7)^ 0, 

(in) C+{U) C I+{x,U) = J+(x,ZY). 

Proposition 3.1.1 will be a corollary to Lemma 3 . 1.4 below which is 
a result from Lorentzian geometry. Choose a representative g G € and 
denote the associated Levi-Civita connection by V. The exponential map 
allows us to identify the causal structure of a convex neighbourhood of 
x £ M with the causal structure of an open, convex set of the tangent 
space at x. 

Lemma 3.1.4. Let (M, Cl) be Manifold with Lorentzian conformal struc- 
ture, g ^ and U be a convex neighbourhood of x ^ M with respect to 
the Levi-Civita connection of g. 

(i) y G I~^{x^U) (respectively J~^{x,U)) if and only if y = exp^(u) 
where v a future pointing timelike (resp,, caiisa^ vector, 

(ii) J+(x,ZY) =/+(cc,ZY), 

^ The complete proof of Proposition 3.1.1 requires the material from page 129 
immediately after the proof of Proposition 2.6.4 up to the end of Sect. 2.6. 
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Proof of Proposition 3,1 A, Choose any g € € and let u 6 TxM be a 
timelike vector with u) = —1. Each vector v € TxM can be uniquely 
decomposed as u = v^u + v±_ here v± € ~ {v e TxM : g{UjV) = 0} 

and € M. The bilinear form h{v,w) = g{v^w) + 2v^w^ is a Euclidean 
scalar product on TxM. For every e > 0 let Be{0x) — {v e TxM : 
h{v^v) < e}. This set is obviously a neighbourhood of 0^; in TxM. A 
similar argument as in the proof of Theorem 2.6.2 shows that = 
expo^,(j5e(0a;)) is a convex neighbourhood of x for small enough e. 

The set 5^ = {u e TxM : g{v,v) = 0,g{u,v) — is a 

submanifold of TxM which is diffeomorphic to the (n — 2)-dimensional 
round sphere = {z e = 1} which lies in the 

boundary of Be{ 0 x)^ The map (0, 1) x 5g — ^ M, {t,v) exp^(tu) is a 
parameterisation of C^{Uc) \ {x} which proves the first claim (i). 

Since the map exp^: Be{ 0 x) ^ is a diffeomorphism assertion (ii) 
follows from Lemma 3.1.4 and the fact that every causal vector w € 
Be{ 0 x)q is either timelike or null. Assertion (iii) is a trivial consequence 
of Lemma 3.1.4 (ii). I 

Proof of Lemma 3 . 1 . 4 * (0* We prove the statement for I^{x^U) (the 
proof for J^{x,U) is analogous). The exponential map is a diffeomor- 
phism of a neighbourhood U of Ox G TxM onto U. For any geodesic 7 
we obtain V^(p(7,7) ~ 25^(V^7,7) = 0 whence the velocity vectors of 
geodesics do not change their causal class. It follows that exp^ maps 
timelike vectors into J+(x,ZY). We have to show that for each point 
y e I~^(x^U) the vector (exp3.)“^(y) — ve TxM is necessarily timelike. 

The double cone Cx = {v e U : g(v,v) = 0} divides U\Cx 
into 3 connected components: the future and past full cones of timelike 
vectors (7°’“) and the set of spacelike vectors (<5°’^). Applying 

the diffeomorphism exp^: U U see that the set CxiJA) ~ {z ^ 
U \ '^v ^ Cx with 2: — exp^(u)} divides U into the sets 

respectively. For every y G I'^{x^U) there is a timelike curve 
7: [a, 6] Z//, t i-> 7 which connects x and y. From g{j{a)^'y{a)) < 0 

we know that 7 must initially enter If the assertion does not hold 
then y 6 C^{L() U Since /"^(o;,ZY) is open we can assume without 
loss of generality that y e Hence 7 must intersect Cx at some point 
7(to) where 7 leaves Since 7(^0) is timelike and future directed it is 
transverse to Cx at 7(^0) points into . But this is a contradiction 
to the construction of the point 7(^0)* 

For (ii) it is sufficient to note that the set of causal vectors in TxM 
is the closure of the set of timelike vectors in T^M. I 
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Since a Lorentzian conformal structure allows to measure angles and 
relative lengths (at a given point), it is sufficient for spatial geometry at 
one point. We just loose an absolute calibration. 



3.2 Inertial observers: the projective structure 

So far we only have considered light propagation. In order to specialise 
our structure further we must take into account other fundamental prop- 
erties of nature. In Chap. 1 inertial observers have played an important 
theoretical role. Since they are not subject to any physical forces one can 
physically implement inertial observers (or particles) by freely falling ob- 
servers (or particles). We will use freely falling or inertial observers as the 
other input into our theory besides the Lorentzian conformal structure 
induced by light rays. 

The following postulate reflects that the movement of an inertial 
observer depends on his/her initial velocity and initial position. This is 
the main content of Galilei’s law of inertia. 

Postulate 3.2.1 (Existence of inertial observers). Through any 
point X £ (M,g) and for any timelike direction Ezi there exists (up to 
parameterisation and extension) exactly one inertial observer 7: E ^ M 
which passes through x with velocity 7 || En. 

Postulate 3.2.1 singles out a collection of paths in spacetime. In Minkows- 
ki space, inertial observers move along straight, timelike lines. This is 
again a global characterisation which we need to overcome by formu- 
lating it infinitesimally, i.e. in the tangent bundle rather than in space- 
time itself. An infinitesimal description of ( unparameter ised) inertial ob- 
servers in Minkowski space is that their spatial acceleration vanishes. 
This property can be generalised as follows. 

Postulate 3.2.2 (Law of inertia). For each x € M there exists a 
chart {Ux<fx) centered at x such that with respect to the corresponding 
coordinate system we have for all inertial observers passing through x 

dl* " ^ ■ 

at X. The chart maps (fx' h(x depend smoothly on the parameter 

x. 

While by its very formulation Postulate 3.2.2 is independent of the cho- 
sen charts, we need to express it in an arbitrary coordinate system. In 
Sect. 2.6 we have seen that the derivative of vector fields is coordinate 
dependent and therefore not well defined in a general manifold. How- 
ever, we can use Theorem 2.6.1 and Corollary 2.6.1 in order to define a 
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connection with respect to which Postulate 3.2.2 can be formulated in a 
manifestly coordinate free manner. 

Let (W, be any chart and x We define the Christ offel symbols 

with respect to this chart at x by 

_ dx^ d'^x^ 
dx^ dx^dx^ * 

where are the coordinates with respect to the chart (px) 

provided by Postulate 3.2,2. We will now prove that this construction 
gives a well defined connection V. Let (U^p) be a second chart with 
x £U, Then we have ~ show that these two 

definitions give the same connection. Indeed, we calculate 

_ dx^ d^x^ 
dx^ dx^dx^ 

_ dx^ dx^ d / dx^ dx^ \ 
dx^ dx^ dx^ \ dx^ dx^ ) 
dx^ dx^ / d'^x^ dx^ dx^ dx^ &^x^ \ 
dx^ dx^ ^^dx^dx"^ dx^ dx^ dx^ dx^dx^ ) 
dx^ dx^ dx^ ( dx^ &^x^ \ dx^ 

dx^ dx^ dx^ dx^dx'^ J dx^ dx^dx^ 

dx^ d^x^ dx^ dx^ dx^ p^ 

dx^ dx^dxfi dx^ dx^ dx^ 

which is exactly the transformation formula provided by Corollary 2.6.1. 
Thus we have a well defined torsion- free connection F and Postulate 3.2.2 
can be restated as follows. There is a connection V such that inertial 
observers are pregeodesics. 

Given our collection of inertial observers, {{Uxi^ijo)}xeM is not the 
only collection of compatible charts such that the formula in Postulate 
3.2.2 holds. In fact, Corollary 2.6.3 implies that exactly those collections 
{(Vx, '4>x)]xeM which induce torsion- free connections that have the same 
pregeodesics as V are also possible choices. This implies the following 
corollary. 

Corollary 3.2.1. Postulates 3,2.1, 3.2,2 determine a projective struc- 
ture ^ such that each particle is a pregeodesics with respect to any V € ?p. 

Weyl characterised the connection as a field which forces a particle to 
be transported parallelly with itself in space and time. Thus we have 
arrived at a geometrical explanation for the law of inertia postulated by 
Galilei. 

^ In order to understand the corollary below we need to know a little bit more 
about projective structures. 
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3,3 Compatibility: Weyl structure 

We have obtained a Lorentzian conformal and a projective structure but 
the relationship of these two geometrical structures is still unspecified. 
We will now introduce a further postulate which links observers and 
light rays and therefore these two structures. It is an experimental fact 
that one can chase light rays with material observers arbitrarily closely, 
provided one uses enough energy The following is a formalisation of this 
idea. 

Postulate 3.3.1 (Compatibility with the causal structure). 

Each X £ M has a neighbourhood U such that for all y e U \ {a:} we 
have 

y ~ 7(t) for an inertial observer 7 through x ^ y e {x,U)Ul~ (x^U). 

As a first consequence of this compatibility axiom we can determine light 
rays using the connection instead of the Lorentzian conformal structure. 

Lemma 3.3.1. The conformal null geodesics coincide with those pre- 
geodesics which are somewhere lightlike. 

Proof Let x e M and let U be the intersection of the neighbourhoods of 
X which are provided by Proposition 3.1.1 and Postulate 3.3.1. Let /i be 
a conformal null geodesic from x to some fixed point y gU. Proposition 
3.1.1 implies that fi lies in the boundary of I^{xM)> Hence there is 
a sequence of points yi e I~^{x^U) which converges to y. For each i 
let 7i : [0, 1] U he the pregeodesic which corresponds to the inertial 
observer which moves from x to yi (cf. Postulate 3.3.1). Let v \ {0} be 
an accumulation point of the (bounded) sequence 7i(0) ^ TxM and let 
7 be the pregeodesic with 7(0) = v. By the continuous dependence of 
solutions of differential equations on initial conditions and parameters 
(cf. Theorem 2.4.1) there are for each point 7(5) (s € [0,1]) and each 
neighbourhood V of 7(5) infinitely many pregeodesics 7^ which intersect 
V and whose velocity ve ctors at s converge to 7(^)* This implies that 
7 is causal and that 7 C /+(a:,lY). Since yi y and U is compact the 
pregeodesic 7 reaches y. T he inclus ion J^{I^{x.^U),U) = I^{xM) and 
t/ 6 7 n C+(W) imply 7 C I‘^{x^U) \ I^{x,U) = C+(W). Since is a 
null hypersurface and 7 is lightlike (causal but not timelike) 7 must be 
a conformal null geodesic. That 7 coincides with // follows now from the 
uniqueness of conformal null geodesics. 

For the converse we simply need to note that both pregeodesics which 
have an initial lightlike velocity vector and conformal null geodesics 
are uniquely determined by initial point x and initial velocity direction 
R7(0). I 

In the rest of this section we will show that our postulates imply the 
existence of a natural Weyl structure. 
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Theorem 3*3*1, Let C be a Lorentzian conformal structure and ^ be 
a projective structure such that the Postulates 3,1,1, 3.2,1, 3.2.2, and 
3.3.1 are satisfied. Then there exists a unique V € ^ such that for all 
g G (L there is a one-form cp with Vg = g. 

The proof of this theorem will be split into several lemmas. 
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Lemma 3*3*2. Let g be a Lorentzian metric and Aabc = ^{abc) be a 
totally symmetric tensor such that A{v,v,v) = 0 for all null vectors v. 
Then there exists a one-form such that A(^abc) = '^(a9bc) ♦ 

Proof. sym(t? 0 p) clearly satisfies the condition of the lemma. We will 
now verify that this is the only possible choice. 

Let t be a vector with ^(t, t) = —1 and e E t“^ with g{e, e) = 1. Then 
i±e are null vectors and from 



Z\^(t ± e, i ± e, t ± e) = A^^bc) ± 3t“t'’e" + 3t“e'’e<= ± e“e'’e‘=) = 0 



we obtain 



0 = ^(a6c) + 3i“e'’e^) , (3.3.1) 

0 = A^^bc) (3t“t^e<= + e^e'-e^) . (3.3.2) 



Setting 

= - (3Z\(„(,c)t“t*’ + 2AH,i,t)i%c) ■ 

Equation (3.3.2) is equivalent to 

A{e, e, e) = -3(A(abc)i“i'’e'') = i?(e) = g(e, e)i9(e) Ve 6 t"^. 
Analogously, Equation (3.3.1) is equivalent to 

^(a6c)e“eH“ = = ii?(t) = 

Finally, the definition of '3 implies 



and 

Aa&c)t“i*’i‘= = 5(t,t)^(t). 

By the polarisation identity for symmetric 3-tensors, we know that A(^abc) 
coincides with g^ab'^c) ^ basis and our claim is proved. I 



^ The proof of Theorem 3.3.1 is rather technical. It can be omitted without 
loss of continuity. 
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Proof, Let {Eq^ . . . , En-i} be an orthonormal basis and 
be the dual basis. We can write L — ® ® ® Ea^ where = 

for all indices a^b^c. We consider lightlike vectors given by iV — 
Eq + cos{0)Ea + sin{0)EB, where A ^ B and 6 e [0,27t]. In order to 
exploit the condition L(AT, N) — c{N)N^ we expand L(iV, N) as a Fourier 
polynomial in 6. Using cos^0 = |(1 -j-cos(20)), sin^cos^ = ^sin(20), 
and sin^ 5 — |(1 - cos(20)), we calculate 

L{N, N) = L{Eo, Eo) + cos\e)L{EA, Ea) + sm^{e)L{EB, Eg) 

+ ^ 008(^9) L(^Eo, E a) 2 sin(0)r(£o) -®s) 

+ 2 sin(0) cos{9)L{Eaj Eb) 

= L{Eo, Eo) + i {L{Ea,Ea) + L(Eb,Eb)) 

+ 2L{Eq^ Ea) cos{9) H- 2L{Eq^ Eb) sin(0) 

+ i {L{Ea, Ea) - L{Eb, Eb)) cos(20) + L{Ea, Eb) sm(2e) 
= c{9) (Eo + cos{9)Ea + sin(0)jFB) 

for some function c{9). The left hand side is a Fourier polynomial of 
order 2 which implies that c must be a Fourier polynomial of order < 1 
since otherwise the right hand side would be a Fourier polynomial of 
order > 3. Hence we can write c{9) = aAB + Pab cos9 + ^ab sin^. The 
right hand side is then given by 



1 1 

o^abEo + -PabEa + -jlABEs + {PabEq + aAsEA) cos 9 



+ {jabEq + ocabEb) sin9 + 



-^PabEa ~~ -JabEb^ cos(20) 

+ (^-'JabEa + -^f^ABEs^ sin(20). 



A comparison of coefficients gives 

L{Eo, Eo) + i {L{Ea, Ea) + L{Eb, Eb)) 

— olabEo + -^PabEa + ■^'JabEb, ( 3 . 3 . 4 ) 



2L{Eo,Ea) = PabEq + aABEA, ( 3 . 3 . 5 ) 

2L{Eo, Eb) — IabEo + aABEs, ( 3 . 3 . 6 ) 

E{Ea, Ea) - L{Eb, Eb) = PabEa - JabEb, ( 3 . 3 . 7 ) 

L{Ea>Eb) = —‘JabEa + -PabEb- ( 3 . 3 . 8 ) 
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We obtain a linear system of equations for the components Ll^ (6 < c) 
of L = Prom Equation 3.3.5 we obtain immediately 

Lf^=0forB^{0,^}, 

^OA = 2^ab^ = -aAB- (3.3.9) 

Further, neither 13 ab nor aAB can depend on B since the left hand sides 
of Equations (3.3.9) are independent of Eb- Equation 3.3.6 im plies in 
addition that = ^aAB- Hence aAB cannot depend on A either. We 
will therefore write Aq := aAB and Aa := ^ab- Equation 3.3.8 implies 
= 0 for A J5 and = 0 for pairwise different A, B, C. We 
also obtain — ^jab and — ^Xa- These two equations are 
only consistent if jab = Xb- Equation 3.3.7 can be used to eliminate 
L{Eb,Eb) in Equation 3.3.4 resulting in 

L{Eo,Eo) + L{Ea,Ea) — aABEo + -^0 abEa + ^'^abEb + -^PabEa 

~ "^IabEb — XqEq + XaEa- 

This equation implies Igo + ^aa = which is independent of A. Hence 
^AA = L%b for a ^ B we can set := We also have Lfo = 

~^AA for B ^ A and Lq^+L-^a = If we set fj,^ = Lqq, all coefficients 

are determined. It is now straightforward to check that = 5?j,Ac) + 
Conversely, this tensor indeed has all the properties listed in the 
lemma. | 

Corollary 3.3.1. Let L{-, ■): TxM x T^M — *■ T^M he a symmetric ten- 
sor such that L{N, N) || N for all null vectors N and g{L{v, v), u) = 0 
for all vectors v. Then for each g e€ there exists a 1-form X such that 

I 

L{v, '^) = 2 + X{w)v) - g{v, Vt;, w e T^M, 

where is defined by X{v) = ^(A^ v) for all v e Ta^M, 

Proof. This follows immediately from Lemma 3.3.6 and the additional 
condition g{L{VyV)v) — 0 for all vectors v. I 

We are now ready to prove the main result of this section. 

Proof of Theorem 3.3.1. We will first show that there exists a unique 
V € ^ such that for each representative ^ £ there is a one-form ip 

with 



{Vywy = V%w- + (dbgdc + - ddgbc) 
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+ \fd9bc - 9d(b<fc))V’W. (3.3.10) 

Recall the formula for A provided by Lemma 3.3.5. Since satisfies 
the assumptions of Corollary 3.3.1 there exists a one- form A such that 
La{bc) = 9a{bK) - Aapbc. The property = 0 implies 0 = g^^La{bc) = 
~{n + 1 )A 5 — Xb — |(n “ 1 )A 5 , whence A = 0 and therefore L = 0. Hence 
we have 



^abc ~~ 2 {^b9ac “h ^c9ba ~~ ^a9hc) ~~ ^bc 

1 

“ ~2^<^9hc ” 9d{b^c) "I" 9a{b^c)' 

An application of Lemma 2.6.3 implies that there is a unique V € ^ 
such that 0 = 0, This proves Equation (3.3.10). 

From Equation (3.3.10) we obtain 

^ a9bc — 9a9bc “ ^ab9dc “ ^ac9db 

” 9a9hc ~ 2 i^a9cb “1* ^b9ac ~ dc9ab) ~ ‘^^c9ab d* 9c(a^b) 

"2 (^ctpbc "b ^c9cib ^6pac) ~^^b9o,c "b 9b{a9^c) 

^ ^a9bc' 

Hence (M, C, V) is a Weyl structure and the theorem is proved. I 
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There are experimental facts which indicate that a general Weyl struc- 
ture has features which have no counterpart in our actual universe. It 
seems therefore necessary to specify the geometrical structure of space- 
time further. 

It is plausible to identify an affine parameterisation (cf. Definition 
2.7.4) of an inertial observer with a standard clock carried by the ob- 
server. The freedom t at-\-b corresponds to the freedom to choose the 
zero on the time axis and to choose the unit in which time is measured. 
An atomic clock roughly works at follows (detail of this mechanism can 
be found in textbooks on Quantum mechanics). Each atom has a charac- 
teristic minimal energy E which it can absorb. A very short while after 
the absorption of such a package of energy the atom will emit a photon 
whose frequency jy with respect to the rest frame of the atom is given 
by E = hi^. Since this frequency is characteristic for each sort of atom it 
can be used to build a clock. 




3.4 Reduction to the Lorentzian structure 167 



In Sect. 1.4.3 we have seen how to calculate E from the world lines 
of the atom and the photon in the context of special relativity. Unfortu- 
nately, we cannot simply apply this calculation here since the number E 
did depend on the Minkowski metric rj and not merely on its conformal 
class. But it is very suggestive to identify this atomic clock with the 
standard clock t given by the affine parameter of the world line of the 
atom. 

We will now see that this identification gives rise to a global effect 
which has not been observed. Let x^y £ y ^ be two events 

and consider two atoms of the same kind which are moving from x to 
y along different paths 71 : [0, cvi] ^ M, 72 : [0, 0^2] M in spacetime 
(cf. Fig. 3.4.1). We will assume that they move initially along the same 



y = jiiai) = 72 ( 0 : 2 ) 




Fig. 3.4.1. The world lines from a? to y of two 
atoms which are initially and finally at rest with 
respect to each other 



path in spacetime and that their clocks are initially calibrated at x, i.e., 
there is an e > 0 such that 71 (^) = 72 (t) for all t e [0,e). We will also 
assume that just before reaching y they are again moving side by side. 
For these observers we obtain 



f 

Jo 



pA (7A(i),7AW) 

Jo 






+ 2 






dt 



f 



g{iA{t)nA{t)) 
g{iA{0nA{t)) 



ip{jA{t))dt. 
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Since the frequencies were initially equal we obtain 

In (5(7i(ai),7i(o:i))) - In (fl>(72(a2),72(a2))) 

= In (ff( 7 i(ai), 7 i(ai))) - In (3(71(0), 71(0))) 

+ In (3(72(0), 72(0))) - In (3(72(0:2), 72(^2))) ■ 



Let Q be any 2 -surface which is bounded by the curves 71 , 72 . An ap- 
plication of the theorem of Stokes (Theorem 2.5.5)^ gives 



In (3(71(01), 7i(ai))) - In (3(72(02), 72(02))) 

pat pa2 

= / <3(7i(t))di - / 

JO Jo 

^ [ dcp^ -2 [ F, 

JQ Jo 



Hence the parameterisation of both curves and therefore — by our iden- 
tification — the frequencies of both atoms are different at even though 
they were the same at :r. As a consequence, the frequency of an atom 
clock would depend of the history of the atoms constituting it. This does 
not seem to be the case. Moreover, the spectrum of far away stars is ap- 
parently independent of the history of the atoms which constitute these 
stars.® Hence we conclude that d^/? = 0 . 



Postulate 3.4.1 (No second clock eflfect). The length curvature 
F — —^d(p of spacetime vanishes identically. 

Notice that the justification of Postulate 3.4.1 requires more interpre- 
tation than the justification of our other axioms. One may argue that 
it is the weakest link in the chain of arguments which leads to general 
relativity. 



Corollary 3.4.1. Assume that Postulates 3.1.1^ 3.2.1, 3.2,2, 3.3.1, and 
3.4^1 hold. Then spacetime is a Lorentzian manifold (M,g). 

Proof. This follows immediately from Theorem 2.7.2. I 



We have now arrived at a geometrical structure which gives the frame- 
work for a description of space and time. The arguments which lead 
to Corollary 3.4.1 may seem so compelling that the reader could ask 
herself or himself why we started with Newton’s theory of spacetime 

® The gist of our argument is that the difference of the frequencies at 71 (ai) = 
72(0:2) is non-zero. That this is the case for suitable paths 71, 72 unless <p 
can be chosen to vanish should be plausible even without appealing to the 
theorem of Stokes. It is needed for a strict proof though. 

® This argument does not depend on the identification of atomic clocks with 
the affine parameter of their world lines. 
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instead of motivating our postulates directly. In fact, from a purely con- 
ceptional point of view it is advantageous to analyze the measurement 
of space and time relations and to use this analysis in order to arrive 
at a Lorentz structure. This program has been carried out by Ehlers, 
Pirani, and Schild (1972) who also arrive at a Weyl structure and reduce 
it to a Lorentz structure via Postulate 3.4.1. With our preparation this 
article is highly readable and certainly recommended to physicists who 
are interested in the operational approach. 

We have used a more historic approach for two reasons. 

Firstly, most readers are familiar with the classical description of 
spacetime (albeit less formalised, perhaps). 

Secondly, Newton’s theory is also very compelling on first sight. So 
are Galilei’s theory and the special theory of relativity. What is more, 
when these theories where still young and generally accepted it was very 
difficult to see how to improve them. In fact, most physicists and philoso- 
phers would have claimed that these theories are correct in an absolute 
way. There is no doubt that the Lorentzian description of spacetime 
will not be the last improvement either. It is even a prominent topic of 
current research to try to incorporate general relativity into a new gen- 
eral quantum theory of spacetime. It is generally believed that this new 
theory will be qualitatively very different from the geometrical theory 
we have presented here. The reader should recall that we started with 
macroscopic properties of rays of light which do not take into account the 
quantum nature of light. Also, we have always assumed that space and 
time are continuous rather than discrete. Hence there are several points 
where our theory of spacetime may prove inadequate. It must also be 
said, however, that a conceptionally satisfying theory of quantum gravity 
does not yet exist. 

Our description of spacetime is much better than previous theories 
and to date there is no other theory which describes the global properties 
of spacetime better. 
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4. Pseudo-Riemannian manifolds 



We have learned that spacetime can be described by a Lorentzian man- 
ifold. In this section we will investigate the slightly more general case 
of pseudo- Riemannian manifolds in detail The development of the 
theory of spacetime will be continued in Chap. 5 where we motivate 
Einstein^s equation, the central equation in general relativity which 
links matter to gravitation. 

Readers who wish to get to Einstein’s equation quickly may skip 
most of Chap. 4- They only need to read Definition 4^2.2 and Sect 4-^ 
up to and including corollary 4-^C. 

For mathematicians, this chapter contains the essentials of (pseu- 
do) -Riemannian differential geometry. Almost everything we present 
here will he used in the following physically motivated sections. 

Prerequisitives of this chapter: Sect. 2.7.1 and Sect 2.8 (up to but 
not including Lemma 2.8.2). 

Recall that a pseudo-Riemannian manifold (M, g) consists of an n- dimen- 
sional manifold M and a symmetric, everywhere non-degenerate ( 2 )- 
tensor field g. We will often denote g by (•,•), A pseudo-Riemannian 
(M^g) is called a Riemannian manifold (respectively, Lorentzian man- 
ifold) if g has signature is +) (respectively, (-, -h, . . . , +)). The 

simplest example of a Riemannian manifold is Euclidean Space, 
d(a;^)^ H {- d(a;’^)^), and the simplest example of a Lorentzian mani- 
fold is Minkowski spacetime, (M’^, —d(o;^)^ + d(a;^)^ H -1- d(a;’^"'^)^). 

In this book, we are especially interested in Lorentzian manifolds 
as mathematical models of spacetime, Riemannian submanifolds (cf. 
Sect. 4.4) of codimension 1 can be thought of as instants of time. They 
will play an important role when we discuss the initial value problem 
in Sect. 5.4, Pseudo-Riemannian manifolds which are neither Lorentzian 
nor Riemannian are rarely applied in physics. However, it does not come 
at any additional cost to widen the discussion to this more general case. 

Unless explicitly stated otherwise all geometrical objects are under- 
stood to be derived from the metric and the Levi-Civita connection. 

Remark 4^0.1. The investigation of hypersurfaces in Euclidean space has 
a very long tradition in mathematics and it has led to many important 
(mathematical) developments. (Pseudo)-Riemannian manifolds are the 

^ We only collect those facts which are essential to an understanding of Ein- 
stein’s equation which will be presented in the next chapter. 



p. 169 I 
[i P> 174] 



M. Kriele: LNPm 59, pp. 171 - 254, 1999 
© Springer- Verlag Berlin Heidelberg 1999 





172 4. Pseudo-Riemannian manifolds 



natural generalisation of these hypersurfaces and therefore of indepen- 
dent interest to mathematicians. While we will not push this angle, read- 
ers primarily interested in learning differential geometry should keep in 
mind the following example of a Riemannian manifold. 

Let M C be a hypersurface and consider for each x € M the 
tangent space TxM as a subspace of R”'. To be concrete, let M R^ 
be the natural inclusion and identify TxM with i^TxM c Tt(a,)R’^ « R^. 
We denote the standard scalar product of R^ by (•, and define the 
Riemannian metric of M by 

. g{v,w) = (t*v, 
for all vectors v^w ^ TxM, 

While this class of examples is rather simple, it is sufficient for vi- 
sualising most important features of Riemannian manifolds. Whereas 
Euclidean space is trivial in the sense that d(x^)^ + • • * + d(x^)^ is a 
constant tensor field with respect to appropriate coordinates, g is non- 
constant in general and the curvature of its Levi-Civita connection does 
not vanish. 

Example 4-0,1. The simplest, non-trivial example is given by the sphere 
52 = {x e R^ : (xi)2 -f (x2)2 + (o;3)2 = 1} c R3. Denote by t: ^ R^, 

X X the canonical inclusion. In this case we have TxM — {y e : 
{x,y)^s = 0}. We can parameterise (a dense open subset of) the sphere 
using the chart (U^(p) where 

( cos (f) cos 0\ 
sin(f)cos6 , 
sin 9 J 

Let {j^i, £^ 2 } be the standard orthonormal basis of R^. Prom 

we obtain gee = g{dej de) ~ 1, geip — 0, = cos^ 9. We could now use 

the Koszul formula (Equation (2.7.7)) to calculate the Levi-Civita con- 
nection and we could determine the Riemann tensor through the formula 
given in Theorem 2.8.1. In Sect. 4.4 we will study general submanifolds 
and present better techniques for calculating these quantities. 

A submanifold of Minkowski space does not necessarily inherit a Lorentz- 
ian metric — in fact, the example (^) \ {^} shows that a hypersurface 



— cos (j) sin 9^ 

— sin (j) sin 9 

cos 9 

— sin (j) cos 
cos (j) cos 9 

0 
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in Minkowski space may not inherit any pseudo-Riemannian metric. In 
contrast to the Riemannian case, it is probably not wise to try gain- 
ing intuition for Lorentzian manifolds from studying submanifolds of 
Minkowski space. We should instead use the intuition we gained in the 
previous chapters. Lorentzian manifolds serve as models for space and 
time as a unit — and in this physical way their geometry can be under- 
stood best. 

Since the metric of a pseudo-Riemannian manifold is everywhere non- 
degenerate we have a canonical isomorphism of vectors and one-forms. 

Lemma 4.0.1* Let (M^g) he a pseudo-Riemannian manifold. The met- 
ric induces an isomorphism (*)^ : T^M T*M, v ^ where v^{w) = 
gx{v,w) for all w G TxM. 

Proof. This follows immediately from the fact that g is non-degenerate. 



We denote the inverse isomorphism by (-)^ : T*M T^M. This isomor- 
phism can be naturally extended to tensor fields. 

Definition 4.0.1. , Let 'll) e Tf{TxM). Then we define the completel'y 
covariant tensor 'ip'^ by 

• • • ) '^5+r) — • • • ? ('^s+l) ? ■ • • ? ('^s+r) ) 

and the completely contravariant tensor 'ip'^ by 

. . . ,0;'-+^) = V’(K)", • • • , 

The components of 'ip^ are often simply denoted by the 

components of 

The isomorphisms (-)^, (*)^ are often referred to as “raising and lowering 
of indices”. This terminology is motivated by their expression in the 
abstract index notation. We write {g^)ab = {v^)a ^ 

and = g^^^h =’ The symbols “|j” and “b” should be easy to 

remember since there is an analogous notation in music. 

One of the most important inequalities in linear algebra is the Cauchy- 
Schwarz inequality for positive definite scalar products (♦, It states 
that for every pair of vectors u, w the inequality 

I I < 

holds. This inequality clearly generalises to Riemannian manifolds. To 
us, an analogous inequality for Lorentzian manifolds is much more im- 
portant. 
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Lemma 4.0.2 (Inverse triangle inequality). Let{M,g) beaLorentz- 
ian manifold and let v,w be causal vectors. Then the inequality 

I {'^,'w) I > \/i (v,v) \ ^y\ (w,w) I 

holds. 

Proof. The inequality holds trivially for null vectors. Hence we can as- 
sume that both w and v are timelike. There is a vector e J. v and a 
number a such that w = av + e. This implies (e, e) > 0, 0 < (w, w) = 
o? (v, v) -b (e, e) and therefore 

{v,wf = (v,vf = i{w,w) - (e,e)) {v,v) > (w,w) {v,v) . 

[p- iTi 111 I 

i p. 176 

In the rest of this section we prove a somewhat technical lemma which 
may be omitted on first reading. 

For any u G the n-dimensional Vertical” subspace Tu.,.TxM of 

Tu.,TM is given by the image of the map T^M ^ TuTM, v ^ i)[^| ;= 
■^{u -h We can equip Tu.^T^M with a pseudo-scalar product by 

defining ) = (v,w). In general, the push forward of the expo- 

nential map fails to be a linear isometry of the spaces Ty,.,TxM and T^M. 
However, if one of the vectors is aligned with we have the 

following invariance. 

Lemma 4.0.3 (Gaul3 lemma). Let x G M, u e T^M \ {0}, and 
W[u] € TuJTxM. If there are 6 C T^M and w G T^M with = 

-F then 

(T^exp(u[^]),T^exp(u)[^])) {v,w) . 

Proof The assertion is a formula which is linear in and . Since 
u and V are parallel we can therefore replace u by n in this formula. 
Consider the map /: (s, t) exp^{t{u + sw)) G M and let 

/ f 

Observe that we have V Q^ft ~V Q^fs for all s since V is torsion free. 
The curve t f-> exp^{t{u + sty)) is a geodesic, which implies 
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1 / 1 / 

= 2 9s = 2 9^(m + s«J,u + s «;) = (u,w). 

This equation can be easily integrated yielding (/*,/«) — t (u, w). Hence 
the assertion follows by setting (s,t) = (0, 1). I 



4.1 Existence of Lorentzian 
and Riemsinnian manifolds 

This section is included for its theoretical interest and can be omitted 
on first reading. 

Theorem 4.1.1. Every manifold M carries a positive definite metric g. 

Proof. Let {(Ua, ^a)}a€A} be a collection of charts which covers M and 
{fa}a€A be a partition of unity subordinate to {Uof]ot^A> The bilinear 
form 

•” ^ ^ h 0 dx^. 

is evidently well defined and symmetric. Let j be an index with x e 
supp(/o:^.) and v G T^M be any non-zero vector. The estimate 

9x{v,v) = Y, > 0 

oi^A i=l . i=l 

implies then that gx is positive definite. I 

A line field (or line bundle) is a 1-vector subbundle with base manifold 
M of the tangent bundle TM. A nowhere vanishing vector field U gen- 
erates a line bundle but it is possible to have line bundles which are not 
generated by vector fields (cf. the example of a Mobius band). 

Theorem 4.1.2. A Manifold M carries a Lorentzian metric if and only 
if it admits a (nom oriented) line field L 

Proof Let be a Riemannian metric of M and assume that there exists 
a line field I on M, Each point x e M has a neighbourhood U such 
that for all y e U the line field I can be represented by ly — R{7^, 
where C7 is a vector field on U. We can assume without loss of generality 
that h{U^U) — 1. Then U is determined up to a factor ±1. Hence the 
( 2 )-tensor field g h — 2U^ 0 is globally well defined. We restrict 
now attention to U again. Let {C/ 2 , . . . , C/n} be a completion of C/ to a 
local orthonormal frame with respect to h. That the tensor field g has 
signature (~, -h, . . . , +) and is therefore a Lorentzian metric follows from 
g{Uu Uj) = h{Uu Uj) = J,,-, g{U, Ui) = 0, g{U, C/) - 1 - 2 - -1, 
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Conversely, assume that M admits a Lorentzian metric g and let h 
be a Riemannian metric on M. Then at each point x e M we have a 
linear map -4: T^M TxM defined by = g'^^hjk, where we use the 
Lorentzian metric g to raise or lower indices. The tensor A is symmetric 
with respect to h since 

h{v,Aw) = hkiv'^A^w^ = hki 9 ''‘hjiv'^w^ = hjiA].v'^tiA = h{w,Av) 

for all vectors v, w. Hence there is an h-orthonormal basis of eigenvectors 
of A. Since g has signature (-1, 1, . . . , 1) the linear map has one negative 
and n - 1 positive eigenvalues. It follows that a each point x there is a 
uniquely determined 1-dimensional subspace of TxM which is spanned 

by the eigenvectors corresponding to the negative eigenvalue. Clearly, I 
defines a line field of M. | 

Corollary 4.1.1. The Sphere 5^ = {x € + = 1} 

does not admit a Lorentzian metric. 

Proof. If there is a Lorentzian metric on then there is also a line 
field Z on 5^. Let Zi be a Riemannian metric. Then there are at each 
X € 3“^ exactly two vectors ±Z7x € lx with h{Ux,Ux) — 1. We will 
now construct a non- vanishing vector field V on 5^. At x we choose 
Vx = Ux. and consider all great circles through x. These great circles 
cover all of 5^ and each point but {±x} is intersected by exactly one 
such great circle. The points x and -x are intersected by all great circles. 
We define now F on 5^ \ {-x} as follows. Along each great circle 7 
we let V^(t) € be uniquely determined vector such that 

1 Z7y(t) is smooth. These vector fields along great circles have each a 
limit vector in {U-x, -U-x}- Let now 71 : [0 , tt] -4 5^, 72 ; [0 , tt] — > be 

two great arcs from x to — x. Since we can smoothly rotate one arc into 
the other and V depends smoothly on parameters it is clear that both 
limits limj—,^ V!yj 6 {tZ—x, U—x} and € {C/_x, — C/_x} 

must coincide. But this implies that all arcs have the same limit at {— x} 
and that therefore we have defined a continuous, non- v anishin g vector 
field V. This contradicts Theorem 2.5.6 and therefore there cannot be a 
Lorentzian metric on 5^. | 

We will see in Proposition 8.1.1 that there are other reasons why one 
may want to discard compact models of spacetime. 
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In Sect. 2 . 5.4 we have seen that for a general manifold without addi- 
tional structures it is possible it to define an integration of n-forms 
but not an integration of functions. In an oriented pseudo-Riemannian 
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manifold there is a canonical n-form, the volume form. This allows 
us to define the integral over a function (cf. Definition 

The Hodge star operator is a canonical isomorphism of p-forms 
and n — p forms. It can be used to put Maxwell’s Equations which 
describe electromagnetism and light (cf. Sect. 5.2.3) into an espe- 
cially simple form. A further motivation is given in the introduction 
to Sect 2.5.1. 

The discussion of the Hodge star operator can be omitted on first 
reading and will not be needed in the rest of this book. 

This section draws on Sect. 2.5 

The isomorphisms b, jj (cf. Definition 4.0.1) induce a pseudo-scalar prod- 
uct on TJM. 

Lemma 4.2.1, The bilinear form 



X -> r, 



is a non- degenerate, symmetric bilinear form. The pseudo- scalar product 
g[^] is positive definite if g is positive definite.'^ 

Proof For £ Tf{TxM) we calculate 



(-0^ (g) (f>^) 



d\ . . 
6i , , 



__ lai-.-an 

- Vc.+i...c,+,.<Pb.+i...b„+,.9 ■■■9 

X gbidf- 9b. d.. 






It follows that the total contraction of this tensor is symmetric in its 
constituents and Let 

be a pair of dual orthonormal bases and ^ G TJ (T^M). Since (w*)** = ±ei 
and (e^)^ = (the sign depending on the signature and i) we have 

fl'IsKV', fiai <8i • • • ® ® ® • (8 w”’-) = ±x/>(e„, , . . . , , w”’ ) 

which in turn implies that g[l] is non-degenerate. The assertion for Rie- 
mannian metrics follows from 

5[sl(V'>V’)= XI 

Ol . . j<Xf» ) 

br,...,br, 

Cl > 

f... fd^- 

X • 5 ^Ci, 5 ^ ^ . 5 )^aici * • ‘ ^diiCtt^bidi • • • ^bj.dr* 



^ The converse is not true since g{n] is always definite since it is a non- 
vanishing bilinear form on a one-dimensional space. 
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The metric measures not only the length and the angle of vectors but 
also the volume of a parallel-epiped spanned by a collection of linearly 
independent vectors. 

Lemma 4.2.2. Let (M^g) be a .pseudo- Riemannian manifold and x € 
M. Then x has a neighbourhood U such that there exist exactly two n- 
forms ja, -jx with |5'[n](M±5M±)l ™ Furthermore^ if {Ei^ . . , ^ E^} is 
an orthonormal frame then .... En) € {-1,1}. 

Proof The first claim follows from the fact that A^{TxM) is a 1-dimensio^ 
nal vector space and g[^]x a non- vanishing bilinear form on it. 

Let [El, ...,En] be an orthonormal frame and be the 

dual frame. Then the tensor fields cp± — ±cj^ (g) ■ - • 0 satisfy 

|y[n](v^±.V^±)l = 1- 

Hence p± = A ^ ^ ^ = dm! alt(v?) satisfies \gZ]{F±^T±)\ = n\. If 

{Eu.,,,En] is any other orthonormal frame and A e tA{U) is defined 
by Ei = AEi = A\Ej then ^±(^i, . . . , = det(A})/x±(£;i, . . . , E„) = 
det(^p. The second claim follows since the determinant of a linear map 
which transforms orthonormal bases into orthonormal bases is always 
equal to either 1 or —1. | 

The preceding lemma implies (by continuity) that on a connected pseudo- 
Riemannian manifold there are at most 2 n~forms ±pL which satisfy the 
normalisation condition ib)| := n!. Evidently, this is the case 

if and only if there exists a non- vanishing n-form on M, i.e., if M is 
orient able. 

Definition 4.2.1. If[M,g) is an oriented pseudo- Riemannian manifold 
with orientation O then the volume form is the unique n-form fi ]\4 G O 
Mm)| “1. IfUcM is an open set with compact closure 
we define the volume ofU by vol(ZY) = 

The following definition reduces to the volume integral in Euclidean ge- 
ometry 

Definition 4.2.2. Let (M,g) be an oriented pseudo- Riemannian mani- 
fold with volume form fx. Then the Integral of a function / over an open 
set U is defined by fjxfi 

^ Reader s who hav e omitted Sect. 2.5 can replace f^ fXM by 

S<p(U) . . . dx'^, where (p is chart map whose domain 

contains U. 

^ As in the preceding footnote, f/x may be replaced by Mm by f o 
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Lemma 4.2.3. Let {M^g) be an oriented pseudo-Riemannian manifold 
and . . . ^x'^) a positively oriented coordinate system. Then the coor- 
dinate expression of fj^M is given by 

fJ-M = y|det (( 5 ij)i<ij<„)|dx^ A • • • A dx”. 

Proof. If (j/^, • • • , J/”) is another positively oriented coordinate system 
then the number det(^^^ is positive. We denote the metric com- 
ponents with respect to {y ^, . . . , J/”) by gab and obtain 

_ _ dx'^ dx^ 



which in turn implies 




positively oriented coordinate system is globally well defined. At x we can 
choose a positively coordinate system with gab{^) — ±5a6- Calculating 

^det ((Py)i<«,j<n)|da:^ A • • • A dx” 
in this coordinate system implies that this n-form coincides with /iM- i 

Definition 4.2.3. Let (f 6 'Tf(M) be a tensor field on (M^g). Then the 
divergence of ip, div((/?); is a tensor field in and given by 

div(^(A\..,,A^”\ui,...,'ys) 

n 

a=l ^ 

where {Ei^ . . . , En; 0^, . . . 0'^} is (any) pair of dual, orthonormal bases. 

This definition is independent of the chosen orthonormal basis — a fact 
which can be seen best in coordinates: the equation div(f^)^-^^*’^^“^(a;) — 

V {^) is clearly invariant and agrees with the definition above |[p. ire i] 
if the Gaufiian frame is chosen to be orthonormal at x. j p. 137 





180 4. Pseudo- Riemannian manifolds 



Lemma 4*2*4* Let {M^g) be an oriented pseudo-Riemannian manifold 
and U be a vector field on M. Then the formula 

= (div(f/))/LiM 

holds. 

Proof Let x G M and choose normal coordinates centred at x. Since the 
Christoffel symbols of V vanish at x we have 

Va = 9a and da (jde^{i9ab)i<a,b<n)^ = 0- 
Hence using Lemma 2.5.3 we obtain at x 



£u 9’M = £jj ^y^det {{9ab)i<a,b<n)^^^ A • • • A dx”^ 
= • y^det ((5ab)i<a,6<„) j dx^ A • • • A dx" 



+ y det ((§a6)i<„,6<n) £u (d^^^ A • • • A dx”) 



= y det ((5afc) !<„,(,<„) ^ dx^ A • • ■ A £[ 7 dx“ A • • • A dx" 



o~l 

n 



= J det ((5a6)i<a,6<n) ^ A • • • A d(C/ J dx'") A • • • A da?"" 



a=l 



~ y A • • • A dU^ A • • • A da;’^ 



a—l 



/ / \ dU^ 

= Y det [{9ab)i<a,b<n) A • • • A dx” 

a=l 



^ QJJCb 

= S ^ = (div(f/))MM. 






dx' 



Since x was arbitrary we have £jjiim = (div(J7))/i everywhere. 



Corollary 4.2.1 (Theorem of GauB). Let{M,g) be an oriented pseu- 
do-Riemannian manifold, U be a vector field on M, and V C M an open 
subset of M with smooth boundary dV. Assume that dV is a smooth sub- 
manifold and that there is a vector field n along dV with g{v, n) = 0 for 
all V € TdV and ^(n,n) =r] e {-1, 1}. Then we have 

/ div{U)g.M (n, U) pov- 

Jv JdV 
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Proof. Lemmas 4.2.4 and 2.5.3 imply (div(C/))//M = £jj^m = C/Jd/XM + 
d{U J fiM) — d{U J Denote hy o: dV M, x ^ x the canonical 
inclusion. Then we have i*{U J /j>m) — V d fjtM = rj {U,n) 

Hence the Theorem of Stokes (Theorem 2.5.5) implies 

/ div{U)fj,M = / d{U Jfj,M) = r] / {n,U) hqv. 

Jv JV Jdv 



We will now introduce a canonical isomorphism between the space of p- 
forms AP{TxM) and the space of (n— p)-forms See Sect. 2.5.1 

for the motivation of this isomorphism. 

Proposition 4,2.1, Let (M^g) be an oriented pseudo- Riemannian man- 
ifold with volume form fi. For each p ^ {0, . . . , n} there is a unique linear 
isomorphism 

i<: A^{TxM) 

such that g[n-p]{*i’,4') = glnKi’ ^ ^ ^ A^iT^M), cj) e 

The operator -k is called the Hodge star operator and the isomorphism 
induced by k the Hodge star isomorphism. 

Proof Let {ei, . . . , e^}, . . . , be a dual pair of orthonormal bases 

and Tji = {ci, ei). Then the set {o;^^ A * • • A is an or- 

thonormal basis of A^'~'P{TxM) with 

A • • • A = Va^Sa.b^ ■ ■ 

where we have assumed ai < - ■ ■ < an-p, bi < • • • < 6n-p* We can 
therefore write 

*V'= X! A--- Aw""-" 

d\ <C" ' <C. dji, — p 

= X 5[n-p](*^. A • • ■ A 

ai<'“<aT»,_7> 

xrja, A--- Aw“"-" 

= X [° ](^ A A • • • A , /j,) 

d\ <! ■ • ' <^dfi—p 

X A ■ • • A 0)“"-'’. 

The last formula proves that k'lp is uniquely defined by the defining 
property of k. Further, our explicit basis representation shows that k^f) 
exists. Since 4>) = ^?[nK'0 ^ is invariant equation, our 

representation of k'lp does not depend on the chosen basis. I 
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Corollary 4.2.2. Let 'ip e A^{TxM) and {ei, . . . ,6^}, {o;^, . . . ,o;^} be a 
pair of orthonormal dual bases. Writing rfi we have 

0,1 

Proposition 4.2.2, The operator A^{TxM) yl^“^(Ta;M) is an 
isometry for even index i/ and an anti-isometry for odd index u. 

Proof Let {ei, . . . , e^}, {u )^^ . . . , a;^} be a pair of orthonormal dual bases 
and assume ai < • • • < a^j^i < ■ ■ • < an, < • • • < 6p, < • •• < 

• ■ • an} = {^1, . . . , bn} ™ {1, . . . , n}. By Corollary 4.2.2 we have 

-ku)^^ A • • • A =: sign(ai, . . . , an)7/ay,+i - • • A • • • A a;®” 

and therefore 

9[n-p\ A • • • A a;"""’ , A • • • A a;^"' ) 

- sign(ai, . . . , an)sign(6i, . . . , 6n)r?a,,+i * • • • • • Vbr,. 

X y [n-j,] A • • ■ A A • • • A a;^^’- ) 

“ ■ Aa;“"'',a;^^^+^ A • • • A aj^” ) 

” ’ ■ ' Par,.^aj>+ibp+i * ’ • Saj,.b-r,, 5 

where we have used that A - • • A A • • • A 

vanishes unless a^+i ^ 6p4,i, . . . , an = 6n- Since 

A • • • A a;*’! A ■ • ■ A w'’’’) = 77ai ■ ■ • »?a„^ai6i ■ ■ ■ ^apbp 
and (by our selection and ordering of indices) 

^ap+ibp+i ■ ■ * ^ar,bj,. = ^aibi ' ' * ^apbj, 

holds, we have 

9ln-p] A ■ ■ ■ A , ^a;^^ A ■ • • A ) 

= A ■ ■ ■ A A • ■ ■ A 

Vat --Va„ ' 

= (-l)'"fl'[p](w“‘ A • • • A A • • • Aw*’"). 



Lemma 4.2.5. Le^ (M, p) be an oriented pseudo- Riemannian manifold 
of index u with volume form ji. Then 

(i) *1 = fi, *fj. = {-ly; 




4.2 The volume form and the Hodge star operator 183 



(ii) = (- 1 )^" for all (j> € Ap{T^M), € 

A^-P{T^M); 

(in) <f) A ■ki> = g^^{4>, i’) fJ- for all e Ar{TxM); 

(iv) 

Proof (i): By definition we have 5 [“](*!,/*) = 5 [°](lA/i, /u) = 

and the first claim follows since is non-degenerate and A^{TxM) is a 
one dimensional vector space. The other claim can be proved analogously 
but it also follows from (iv). 

(ii) : Using Lemma 2.3.8 we can directly calculate 

9ll-pK*<l>, = 9?n\{^ A^,g) = A 4>, M) 

(iii) : From Proposition 4.2.2 we get 

9[IK<I> A*'tp,fj) = gt-p\i*4>,-><'>P) = 

The assertion follows now from ij) = rji . . .r}n = (“1)^- 

(iv) : For every e A^{TxM) we have 

g[l](**f>,i’) ^ 9t]{*<pAi),n) 

whence the claim follows from the non- degeneracy of g[^]. I 

Lemma 4.2.6. Let (M^g) be an oriented pseudo- Riemannian manifold 
and u) € i?^(M). Then we have div(o;) = (-1)^”^ :^rd^o;. 

Proof. Let x e M and choose a normal coordinate system centered 
at X such that {9i, . . , , 5n} is orthonormal at x. Since the equation to 
be proved is linear in oj we can assume without loss of generality that 
oj — /d.T*^ A * • * A dx^?^ where i\ < • • • < ip. We obtain at x 

★a? = /sign(ii, . . . , in)r}i,>+t • • • A dx''^-. 

Since the first derivative of the metric components at x vanish we can 
ignore them in the calculation oi die lo and get 

n 

(d ■kU))x = 'Y^ djfs\gn{ii , . . . , in)m„+i • • • rji,,dx^ A da;*"+i A • • • A da:*” 
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Let 77€{~l,l}be the number which satisfies d:r-^ A A • • • A — 
7]dx^^^ A • • • A dx^^- where jp < - ' < jn and let cry] be the permuta- 
tion of u, . . . , ip which moves j into the last entry and which leaves the 
relative order of all other entries fixed. Then we get sign(ji, . . . , — 

sign(ii, . . . , ip) resign (<7[j]). We get then 

A A • • • A drr^'" ) 

= ^sign(ii , . . . , jn)Vh ■ ■ • Vj„-i A 

= sign(ii, . . . , i„)sign(<T[j])??jj . . . A drA-i 

= sign(ii, . . . , in)siga{ay])rij ^ . . . 

X dj J ^dx^ A dx** A • • • A dx^ A • • • A dx*’’) 

= (-l)^-^sign(ii, . . . , in)rij, . . . 

X dj J ^dx^ A dx*^ A • - • A dx*’’) . 

Hence we get 

n 

(*d * ^ djf sign{ii, in)Vi„+i • • ■ Vin (-1)’’"^ 

X sign(ii, . . . , in)Vji ■ ■ ■ -■ (dx^ A dx** A • • • A dx*”) 

n 

= J ^dx^ A dx*^ A • ■ ■ A dx*^) 

j=i 

= (-l)P“Miv(w)a;. 
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The material presented in this section up to and including Corollary 
Jf.,3A is fundamental for Sect 5 on relativity. 

The rest of this section can be omitted on first reading. 



The Riemann (or curvature) tensor R{U^ V)W — S/ jj\7 yW ~V yV ijW — 
W]^ ^ connection has been introduced geometrically in Sect. 2,8. 

Since it arises in such a natural way, one would suspect that the Riemann 
tensor of the Levi-Civita connection plays also an important role in the 
physical theory of spacetime. This is indeed so as we will see in Chap. 5. 
Here we will merely investigate some of its mathematical properties. 
The Riemann tensor satisfies a collection of useful algebraic identities. 
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Proposition 4*3.1* Let (M^g) be a pseudo- Riemannian manifold^ x € 
M, andt^u.v^weTxM, Then 

R{u^v) — — 

i?(u, v)w + R{v^ w)u + R{w^ u)v = 0 (First Bianchi Identity)^ 
{R{u^ v)w^ t) = — {R{u^ v)t^ w) j 
{R{u^v)w^t) = {R{w^t)u,v) . 

Proof, The first identity is clear from the definition of the curvature 
tensor. The second identity is just the first Bianchi identity which holds 
for any torsion- free connection (cf. Lemma 2.8.1). 

Since 

{R{u^ v){w + t),w 1) 

— {R{u, v)w, w) + {R{u^ v)w, t) + {R{u, u)t, w) -1- {R{u, v)t^ t ) , 

the third identity is proved once we have shown that {R{u^v)w,w) = 0 
for all vectors u,v^w € TcM. We can assume that there are vector 
fields U, F, W which are linear combinations of Gaufiian vector fields 
with constant coefficients and satisfy Ux = u^Vx = Wx = w. Then we 
have [[/, F] = 0 and calculate 

{R{u,v)w,w) = {^U^VW,W) - (VyV^,W) 

= (Vy W", W) - {VyW, VijW) 

- Vy {WjjW, W) + (V[;W, VyW) 

= V[/ (Vy W) - Vy {VuW, W) 

= • y • (Ty, Ty) - iy • [/ • (w, w) 

= 0 , 

where in the last equality we have used that the vector fields have con- 
stant coefficients. 

The first Bianchi identity implies 
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2 (-) 6 (-) 5 (+) 

+ {R{v^ w)t^ u) + {R{w^ t)v, u) + v)w^ u) 

~ 2 {R{w^ u)v, t) ~\- 2 {R{v, t)u, w ) . 

Here we have used the first and third identity to show that the (-) terms 
cancel and the (+) terms are equal. | 

Recall that by taking the trace we can construct a ( 2 ) tensor Ric from 
the curvature tensor. In the pseudo-Riemannian case we can contract 
Ric with the metric to obtain a function. 

Definition 4.3.1. Let (M, p) be a pseudo-Riemannian manifold. Then 
the function Seal := tr(Ric) = ^^^Ric^b is called the scalar curvature of 
9- 

The second Bianchi identity (Lemma 2.8.1) has the following conse- 
quence which will be of importance in Sect. 5.3. 

Lemma 4.3.1. Let(M^g) be a pseudo-Riemannian manifold. Then 
div(Ric — ^Scal^) = 0. 

Proof. Prom the second Bianchi identity we infer 
0 = 

n 

({V e^R){u,v)w + {S/yR){Ea,u)w + (y^R){v,Ea)w^ 

a—1 

= div(R)(u,u)u; + V.^Ric('U,t(;) - V.^Ric(t;,ti;), 

where we have used the property = 0 of the Levi-Civita connection. 
Replacing v and w with a contraction gives 0 = 2div(Ric)(u) — V.^Scal. 



Proposition 4.3.2. Let (M, g) be a pseudo-Riemannian manifold and 
let . . . ^x'^) be normal coordinates centred at € M. In these coor- 
dinates^ the metric tensor g satisfies 

9ab{s:) = gab{soo) ~ ^Racbd{^o)x''x'^ -f odar""!^). 

Proof Since (a:^ . . . , are normal coordinates centred at xo The Chris- 
toffel symbols satisfy Pijixo) = 0. The Koszul formula (2.7.7) implies 
that 

r-j = + djgu - digij). 
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Hence we obtain 

^idkirij + 9kiri)) = digij + djgu - digij + digij + djgu - digij = 2djgu 

which implies that the first partial derivatives of the metric components 
vanish at xo> It follows immediately that then also all vanish at 
Xq. At Xo we can calculate for the Riemann tensor 

Rld = iR{dc.dM" 

= {^doinddi)-V9^incdi)y 

= - w, 

= (dcdbged + dcddgte - dcd^gbd - dddbgec - dddcgbe + dddegbc'j 

= (dcdbged - dcdcgbd - dddbgec + dddegbc^ ■ ( 4 . 3 . 1 ) 

We will now show that dadhQcd = dcddOab at xq. Recall that in normal 
coordinates centred at xo all rays t — ^ . . . tx'^) which pass through 

a;o are geodesics. Hence we get r^^{tx)x^x^ = 0 for all a; = . . . ^x"^) 

small enough. This implies 

0 = ^{2gaer^ci^^x)x'’x‘') 

= ^ {{dbgac{tx) + dcgba{tx) - dagbc{tx)) xV) 

= t {dddbgac{ix) + dddcgba{tx) - dddagbc{tx)) x'^x^ 

+ 2 {ddgac{ix) + dcgda{tx) - dagdc{tx)) x^. 

Observe that ddgaci^) = 0 and that therefore 

Im jddgac{tx) = d(adS-ac(0))(x) = deddgac{0)x^ ■ 

Hence dividing the equation above by t and taking the limit t — > 0 we 
get 

0 = {dddbgaciO) + dddcgba{0) - dddagbdO)) 

+ 2 (deddgac{0) + dedcgda{0) - dedagdc{Q)) x'^x^. 

= x^x^{idbddgac{0) - dddagbciO) + 2dbdcgda{0) - 2dbdagdc{0)) ■ 

Contracting this equation with x*^ we obtain 

0 = {2dbddgac{0) - dddagbc{0))x^x‘'x‘^ 

Since (x^,...,x”) is arbitrary (for small enough values) this equation 
implies that Gabcd ■= 2d(^ddbg\a\c){^) ~ ^(d9|o|ff6c)(0) = 0. We obtain 
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0 ~ Gabcd d" Ghcad d“ G^abd 

— ^ (29rf^6pac(0) “ dddagbc{^) + ^db^cOadi^) ~ 9h^a9cd{^) 

+ 2dcdd9ah{^) - dcda9db{^)) 

+ ^(25d5c56a(0) - dddbQcai^i) + 2dcda9bd{0) - dcdbgadio) 

+ 2daddgbc{0) - dadbgdc{0)) 

+ -^{2dddagcb{0) - dddcgab{0) + 2dadbgcd{0) - 9a^cfl'6d(0) 

+ 2dbddgca{0) - dbdcgda{0)) 

= ^{^dddbgac(Q) + ^dddagbciO) + 30c9dffob(O)) 

= dddagbcio) + dddbgcaio) + dddcgab{0). (4.3.2) 

This last equation implies now 
0 — Gdbca 

^{2dadbgdc{0) - daddgbciO) + 2dbdcgda(0) - dbddgca{0) 

+ 2dcdagdb(0) - dcddgab{0)) 

= ^{dadbgdcio) + dbdcgda{0) + dcdagdb{0))- 

We interchange b and d in this equation and get daddgbci0)+dddcgba(0)+ 
dcdagbdiO) = 0. A comparison with Equation (4.3.2) gives dcdagbdi^) = 
dddbgcaio) which is equivalent to our assertion dadbgcd = dcddgab- 
It follows that Equation (4.3.1) implies 

^bcd(O) = 5“*(0)(^c56fifed(0) - dcdegbdiO)^ 



and therefore 

RacbdiO) + RadbciO) = dbdcgadio) - dbdagcdio) + dbddgacio) - dbdagdcio) 

= dbdcgadio) + dbddgaciO) - dbdagcdiO) 

= -SdbdagcdiO), 

where we have used Equation (4.3.2). The assertion of the proposition 
follows now from a Taylor expansion of gabix) around the point xq- I 



Corollary 4.3.1. Let A he a tensor field which is pointwise defined as 
an algebraic expression of g and its first two derivatives. Then A is an 
algebraic expression of g and the Riemann tensor R. 
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Proof. Proposition 4.3.2 implies that any tensor field A which is a point- 
wise invariant function of g and its first two derivatives only depends on 
g and R. I 

For 2“dimensional manifolds, the Riemann tensor reduces to a function, a 
much simpler geometric quantity. To see this, let {ei, 62} be an orthonor- 
mal basis of TxM. Since •)•, •) is anti-symmetric in the first two and 
in the second two entries the expression {R{ei^ 62)62, ei) does not depend 
on the chosen orthonormal basis and defines a function R": M M. It 
is easy to see that this function describes the curvature tensor uniquely. 
(For explicit formulas cf. Sect. 4.3.1) 

For higher dimensional pseudo-Riemannian manifolds such a simple 
relationship does not exist. However, it is possible to define a function 
K which maps the space of non-degenerate 2-dimensional subspaces of 
TxM into the real numbers. 

Let 



[p. 184 i] 
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= {span{ua:,Va,} : Ux If t'a;, 5|span{«,,,t)x} is non-degenerate} 

be the set of all two-dimensional subspaces in TM which are either 
spacelike or timelike. 



Definition 4.3.2, The function 
K: ^ R, spanfu^:, 

is called the sectional curvature of M. 



i^RjUx^ Vx)'^X'> ^x) 



Observe that this expression is well defined since the denominator does 
not vanish for any TI e and both the numerator and the 

denominator transform by a factor det(^)^ under a change of basis Ux = 
A-yUx “f A^Vx^ Ux ~ A 2 UX A^Vx' 

Those pseudo-Riemannian manifolds for which the sectional curva- 
ture reduces to a function on M should be especially interesting. 

Proposition 4.3.3. Let (M,^) he a pseudo-Riemannian manifold and 
X e M. If the sectional curvature satisfies K{Ux) = K{flx) for all 
IIx^IIx 6 TxM then there exists a number c E R with Rx{UjV)w ~ 
c{{v,w) u — (u, w) v) for all u^v^w E TxM. 

Proof. Assume that K{IIx) does not depend on the choice of plane in 
TxM. Then the definition of the sectional curvature implies that given 
an orthonormal basis {ei, . . . , e^} we have 

(R(e^, 6j)ejf, = c e^;) {cj^ ef) — (e^, ef) ^ 

for some constant c. From the tensorial property of (R( •,•)*, •) we con- 
clude that 
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{R{u, v)v^ u) —c {{u, u) {v, v) - (tt, 
for all %v e T^M. For every w € we obtain 

0 = {R{u + w, v)v^ u + w) ~ c (j^u + u + 14 ;) (i;, i;} - (ii -j- w, i;)^^ 

= {R{u^v)v^u) 4- {R{w^v)v,u) + {R(u^v)v^w) + {R{w^v)v,w) 

- c( (u, u) {v, i;) + 2 (i4, 14;} (i;, i;) + {w^ w) {v, v) 

- (it, i;)^ - 2 (it, i;) (i4;, i;) - (i;, it;)^ ) 

= 2 (i?(it, i?)i', ix;) + c( (it, It) (i;, u) - (it, v)^ + (ly, w) (i;, i;) — (k;, v)’^ 

- (it, It) (i>, v) -2 (it, Ilf) (i?, 1 ?) - {w, w) {v, v) 

-f (i4, v)^ + 2 (it, i;) (it;, d) + (i;, it;)^ 

2 (i^(it, i;)t^ - c((i;, i;) It - (it, i?) i?), k;) 

which implies i^(it, i;)i; = c((i;, v)u- (u, v) v) for all it, i; G T^M. We can 
now polarise again and get 

0 = i?(it, V -j- ii?)(i; + It;) — c( (i? + It;, i; + lu) It — (it, i; -j- it;) (i> + it;)) 

~ i^(it, v)v — c((i?, v)u — (it, i;) i;) + i^(it, v)w — c((i», it;) it — (it, w) i;) 

+ i^(it, w)v - c((it;, v)u- (i;, it) it;) + R{u, w)w 
~~ c{{w, w)u~ (it;, It) It;) 

= i^(it, i;)it; - c((i;, w)u~ (it, w) v) + R{u, w)v - c((it;, i;) it - (i>, it) w). 

This implies i^(it, v)w - c((i;, it;) it - (it, it;) v) = J?(i 4 ;, it)i; - c((it, i;) it; - 
(it;, i;) It) for all u,v,w e T^M^i.e the expression jR(it, i»)it; - c((i;, w)u- 
(it. It;) u) is invariant with respect to cyclic permutation. Since the cyclic 
R{u,v)w + R{w,u)v 4* R{v,w)u vanishes (cf. Proposition 4.3.1) and 

(it, w)v — (u, w)u^~ (i?, u)w — (it, v)w^ {Wj v)u — (if, It) i; = 0, 

the cyclic sum of i^(it, i;)it; ~ c((i;, w)u- (u, if) ii) gives 0 = 3(jR(it, i;)it; - 
c((u, If) It — (it. If) i;)) . I 

Proposition 4.3.4 (Lemma of Schur). If dim(M) > 3 and the sec- 
tional curvature K : h- > M reduces to a function on M then 

it is constant 

Proof. Proposition 4.3.3 implies that there exists a function x c(x) 
with i^a:(it,i;)if = c(x) ((u, If) It — (it, If) f) for all u^v^w € TxM. Let 
[/, V, W be vector fields whose covariant derivatives vanish at x and which 
satisfy = u, 14 = = if. Then we obtain for any vector t € T^M 

(V^i?) (it, f )if — dc(t) (('f , w)u— (it, w) 1 ?) . 
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Hence the second Bianchi identity (Lemma 2.8.1) implies 

0 = dc{t) ((t>, w)u- (u, w) v) -I- dc(t?) ((u, w)t — (t, w) u) 

+ dc(u) {{t^w) V - {v^w) t ) . 

Let u be a vector which is not lightlike and w ^ 0 he orthogonal to u. 
We can choose t = w and v to be orthogonal to w. Since dim(M) > 3 
we can also assume that u^v^vo are linearly independent. The equation 
above reduces then to 

0 = (— dc(t?)u + dc(tt)u) (u;, w ) . 

which in turn implies dc(u) = 0, Hence dc vanishes on all vectors which 
are not lightlike and therefore must vanish identically. I 

Observe that the preceding proposition is false for two-dimensional pseu- 
do- Riemannian manifolds. 

Proposition 4.3.4 motivates the following definition. 

Definition 4.3.3* A pseudo- Riemannian manifold (M^g) has constant 
curvature if the sectional curvature K : i— > R is constant. 

In Proposition 4.5.2 below we will locally classify all pseudo-Riemannian 
manifolds of constant curvature. 



4.3.1 2-dimensional pseudo-Riemannian manifolds 

We collect formulas for the Levi-Civita connection and the curvature 
tensor of 2- dimensional pseudo-Riemannian manifolds. These formu- 
las will be used in Sect. 7.J^. 

2-dimensional pseudo-Riemannian manifolds are the lowest dimensional 
pseudo-Riemannian manifolds which are not trivial 

Let € {—1, 1} and let {^ 1 ,^ 2 } be an orthonormal frame with 

g{Ex,Ex) = ??i, giE^E^) = 0, g{Ex,Ex) = r\ 2 ^ By Corollary 2.4.2 
there are coordinates (t,g) and functions A(t,g), vif^g) such that Ei = 
e-^(^w)ae and E 2 = 

Proposition 4.3.5. Let (M, be a 2-dimensional pseudo-Riemannian 
manifold. Then [^ 1 ,^ 2 ] = diy{E 2 )Ei - dA(Ei )£?2 and the Levi-Civita 
connection is given by 

V ^^Ei = -riiri2di'{E2)E2, V — diy{E2)Ei, 

V E2^'^ ~ dX{Ei)E2y V = -T]iri2d\{Ei)Ei. 

The curvature is completely determined by the scalar curvature Seal, 
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R{Ei,E2)Ei = -^ScalE2, R{EuE2)E2 - ^Scalf^i, 
z z 

Ric = Iscalp, K{TxM) = ^Scal, 
z z 

where 

Seal = — 2^?7 i (^Ei • Ei • X -{■ (dA(£^i))^) 4- ^2 {^2 • -£ 2 ^ + (^^'(-^ 2 ))^) ^ . 

Proof. We calculate first the commutator of the vector field Ei, £ 2 ^ using 
the ordinary derivative of with respect to the coordinates (t^q). 

[Ei,E 2 ] = D{e-^d,){e-''dt) - D{e-‘'dt){e-^d,) 

= -e~^e-^dX{dt)dg + e-‘'e-^Ddg{dt)+e-‘'e-^du(dg)dt 
- e-'e-^Ddt{dg) 

=0 

= -dX{Ei)E2 + diy{E2)Ei - e-'^e"^ [i^. 

Our formulas for the Levi-Civita connection follow directly from 

~ 2^'>- = 

(y £^2'®^’ “ 2'^^ *^1 = 0) 

( y £1-^2, £2^ = • % = 0, 

E 2 ^‘^’ -^2^ = ^£'2 • % = 0, 

(y £ i £ 2 >£ i ^ = (y ~ ^ 

= {di/{E2)Ei - dA(£i)£2,£i) = rndu{E2), 

^V£ j £ i ,£ 2 ^ - - ( Eu '^ Ei^^'X ~ -%dt/(£ 2 ), 

(v e^E'I^E':^ = - ^V£;j£2 - V £ 2 ^ 

= — (dz^(E2)Ei — dA(Ei)E2,E2) = r^2dA(Ei)E2 

= - <^E2, = “^2dA(Ei) 

The curvature can now be calculated straightforwardly. 

R{Ei,E2)Ei = - '^E2^Ei^'^ “^[Ei,E2]^1 

“ V _g^(dA(Ei)E2) + ?7 i?72V j^^{di'{E2)E2) 

- dz/(E2)V + AX{Ei)V ^^Ei 
= El • El • A E 2 + dA(Ei )dz^(E2)Ei ?7i^2E2 • E 2 • iz E 2 
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— diy(E2)dX[Ei)Ei 4* 7]iT]2{du(^E2))‘^ E2 + (dA(£/i))^J5/2 

= (^Ei»Ei»\ + {dX{Ei))^ 

+ ^1^72 (-®2 • E 2 * V -\- (dt'(£/2))^) ^■®2- 

Since we have 



Ric(£;i,£;i) - tx{R{;Ex)Ex) - 6\R{E2,Ei)Ex) 

= V2 {R(E2, Ei)Ei, E 2 ) — -rj2 {R(Ei, E2)Ei,E2 ) , 
Ric(£^i,£/2) = R-ic(-E'2) £^i) ='^^{E{')E2)Ei) 

= e\R{E2,E2)Ei) + 0\R{EuE2)Ei) = 0, 

Ric(jE*2,£?2) “ tv{R{-^E2)E2) = rji {R{Ei^E2)E2^Ei) 

= -rji (R{Ei,E2)Ei,E2 ) , 



we obtain Ric = |Scal^, where 

Seal = —2r}irj2 {R{Ei^ E2)Ei^ E2) 

= - 2(771 {Ei»Ei»\ + {d\{Ei)f) 

+ Tj2 (•£'2 • E2 • t* + (dt'(£72))^) ^ • 



Finally, the sectional curvature is given by 

{R{Ei,E2)Ei,E2) 



K{T^M) = - 



(Ei,Ei) (E2,E2) - {Ei,E2) 



2 = 



4.4 Submanifolds 

Given a pseudo-Riemannian manifold (M^g) and an (immersed) sub- 
manifold /: — 5^ M it is of interest which geometrical structure E 

inherits from (M,p). Examples for physically especially interesting sub- 
manifolds are spacelike hypersurfaces (describing an instant of time, 
cf. Sect. 5.4) or the integrated light cone. In Chap. 9 closed trapped 
surfaces^ a class of submanifolds of codimension 2, will play a central 
role. 

Remark 4-4^ R l^his section we will explicitly refer to the immersion 
/. Very often the immersed submanifold is a subset of M and / is the 
canonical injection. In this case it is more convenient to omit any refer- 
ences to /. We will often speak of tensor fields along E instead of tensor 
fields along /. 
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We denote the space of vector fields along S by and will use the 

notation introduced following Lemma 2.9.2. In this notation, Lemma 
2.9.1 is fundamental for the following. 

Lemma 4.4*1. Let (M,p) be a pseudo-Riemannian manifold and 
f\S-^Mbean immersed submanifold of M. Then the following holds 
for all vector fields U^V e Tq{S), X^Y e ^^(/). 

(i) V f^uf.V - V f^vf*U = MU, 

(ii) d(x,y) ([/) = + (x, V/,f/F). 

Proof These properties follow immediately from the definition of V j? jjX 
(cf. Lemma 2,9.1) and the fact that V is a Levi-Civita connection. I 

Definition 4.4.1. Let f \ S M be an immersed pseudo-Riemannian 
manifold and g be the metric of M, 

The map f (or the immersed submanifold E) is called non-degenerate 
^ff*9 a non-degenerate -tensor field. A non-degenerate submanifold 
is also called a pseudo-Riemannian submanifold, and a non- degenerate 
hypersurface is also called a pseudo-Riemannian hypersurface. 

Let E be a non-degenerate immersed submanifold of (M, g) and de- 
note by (T^E)-^ the set of all v e Tf(^^^M with g{v,w) = 0 for all 
w € fYTxE. The decomposition Tf^^^M - f^T^E 0 (T^j^E)-^ induces 
projections 

^ f^TxE and v ^ {TxE)-^ 
such that V = v~^ -\- for every t? 6 

Lemma 4.4.2. Let E be a non- degenerate immersed submanifold of 
(M,g) and U,V e Tq^{E). Then X^jjV defined by 

is the Levi-Civita connection of{E^f*g). 

Proof Observe that X^jjV is well defined since / is an immersion. Proper- 
ties (i)-(iv) in Lemma 2.9.1 imply that the Koszul equation (cf. Equation 
2.7.7 in Theorem 2.7.1) is satisfied for V. I 

Lemma 4.4.3. Let E be a non- degenerate immersed submanifold of 
{M^g). The map 

{U,V)^M{U,V) := (7f^uf*yf 

is symmetric in U and V and function- linear. I is called the shape 
tensor. 
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Proof, The map E is clearly function-linear in its first entry. Recall that 
the Lie bracket of any two vector fields tangent to E is itself tangent to E. 

Hence M{U, V) - E{V, U) = = (/*[[/, 1 /])^ - 

0 , which in turn implies that E is symmetric. But then E must also be 
function-linear in its second entry. I 

So far we have considered vector fields tangent to E, We obtain similar 
relationships for vector fields normal to E, 

Lemma 4.4.4. Let E be a non-degenerate immersed submanifold of 
Let U be a vector field on E and N be a vector field along E 
such that N{x) 6 {Tf(^^)f^E)-^ for all x £ E. Then V j^^yN is well 
defined and we have 

= (v/^yiv)^ - (J(y, . 

where H denotes the lift of indices with respect to the induced metric f*g. 

Proof It follows immediately from the coordinate expression that V f^y^ 
does not depend on the extensions of V, N off E. Hence it is well defined. 
Let X be a vector field along E and its tangent and normal 

part. Then we have 

(v_,.vW.x) = {Vf^yN.X^) + 

=0 

= - (jV, 

= ((Vy.yw)'" - (w.ircv'.xT)) . 

I 

Lemma 4.4.5. Let E be a non- degenerate immersed submanifold of 
(M^g) and let [a^b] E be a smooth curve. For every vector n e 
there is a unique vector field N along / 07 with 

(i) N{a) = Uf 

(ii) N{t) £ {f^T^{t)E)^ for all t £ [a,b], 

(in) and (V = 0. 
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We will write N(t) = and refer to it as the normal parallel 

transport of n. 

Proof, Let . . . j£'n-dim(r) be orthonormal vector fields along / o 7 
which span at every point. We can decompose any vector field 

N along /07 with values in (TS)-^ as N{t) = N^{t)Ei(t). Similarly, there 
are functions : [a, b]-^R such that (V “ F^Ei, With respect 

to these decompositions we obtain (V — {^N'^-\-N^F^)Ei. Hence 
the equation (V ~ 0 reduces to a first order system of ordinary 
differential equations and each solution is uniquely determined by its 
initial values N^{a) =n\ | 



Proposition 4,4.1 (Gaufi equation). Let E be an immersed non-de- 
generate submanifold of (M^g), Denote by the Riemann tensor of 
(E,g) and let U, V^W^X e Tq{E), Then we have 

rg{^R{U,V)W,X) 

= {R(U,V)W,X) + {I{U,X),I{V,W)) - {IiU,W),I(V,X)). 

Proof Since this is a tensor equation we can assume that [C/, V] = 0 . We 
calculate 

{R{f.UJ.V)UWJ.X) 

= (^/.£//* + (Xf^uI{V,W),UX) 

- (v/^y/. i'^uW)J*x) - (Vf^ymw),f.x) 

= if* (VjjVvW)j.X) - (A {VvVuW),f,X) 

=0 

+ V f^jj'{][{v,w),ux) - ( J(y, W), V f^uf*x) 

=0 

- V f^vimw)j.x) + ( J(C/, Ty), V f^vf*x) 

= (/* {^R{U, V)W) , /*x) - (J(y W), I{U, X)) 

+ {M{U,W),I{V,X)). 



Since E is neither a tensor field on E nor a tensor field of M, the covariant 
derivative of E is not defined a priori. However, it is easy to check that 
for any vectors u^v^w e T^E and any vector fields C/, V, W with Ux = 
u,Vx = u, Wx — w the expression 
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(v f^w^) {U, V) = V f^wim I")) - 1( V^t/, V) - I{U, v^l/) 

depends only on the values of u, v, w. This justifies to call V J as defined 
above the covariant derivative of I. 

Proposition 4.4.2 (Codazzi Equation). Let S be a non-degenerate 
immersed submanifold of (M,g) and let U,V,W € T^iS). Then we have 

{R(mf.V)UW)^ = (Vf^uEy{V,W) - 
Proof. Let iV be a vector field along S which is orthogonal to TS. 
{R{UU,UV)f^W,N) 

= yf.U^f.vf*W,N) - (Vf^yWf^uf*W,N) 
~yMU,V]f*W,N) 

= i^v^) .^) + yf^u^{V,W),N) 

- ^f.vf* i'^uW) ,n) - {Vf^yI{V,W),N) 

- {][{[U,V],W),N) 

= ( J(C7, VyW), N) + (V W), n) 

- {M(V, VuW),N) - (V f^yW, fF), n) 
-{l{VuV,W),N)-\-{l{VyU,W),N) 

= ( (v (^. W),n)-((w f^yl) {U, W),n). 

The assertion follows since N was an arbitrary vector field with values 

in (TS)^. I 

For some purposes the shape tensor is too complex and also too rich in 
information. In order to obtain a simpler geometrical quantity one can 
average at a given point x e U the shape tensor over all direction in the 
submanifold. 

Consider a fc- dimensional Riemannian submanifold E c M, We 
can then identify the set of directions in TxE with the unit sphere 
gk~i = ^ XxE : if*g)x{v,v) = 1}. This set is a compact Riemannian 

submanifold of the Euclidean space {T^E^ if^9)x)^ We denote by 
the volume form of considered as submanifold of {T^E^ {f*g)x)’ A 
good definition for the average of E over the directions in TxE is then 
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where the vector space valued integral is defined as in Remark 2.5.2. This 
method would not work for pseudo-Riemannian manifolds which are not 
Riemannian since in this case the set of directions does not correspond 
to a compact pseudo-Riemannian submanifold of TxE, 

We will now calculate the average in the Riemannian case. This leads 
to a formula which can be straightforwardly generalised to arbitrary 
pseudo-Riemannian submanifolds. For simplicity we only consider a 3~ 
dimensional submanifold U C M. The general case is analogous but 
calculationally more elaborate. Recall from Example 4.0.1 that we can 
parameterise a dense open subset of using the chart map ip given by 

( cos 4> cos 
sin (j) cos 0 j , 
sin^ ) 

where is identified with via an orthonormal basis {ci, C2, 63}. De- 
noting the induced metric on 5^ by ^52 we have {gs^)d6 ~ [9s^){de, do) ~ 
Ij {9s'^)Bip ~ O5 {9s^)(t>4> — cos^^. Hence Lemma 4.2.3 implies — 

I cos ^|d^ A d(;6. Let 64 , . . . , e„ be an orthonormal basis of (Ta^Z)-^. There 
are bilinear forms P with 

n 

Ix{v,w) = y^^M\v,w)ei 
i=4 



for dll v^w e TxM, Since the volume of 5^ is given by ~ |7 t we 

obtain for the average of ffx 



47T 










^ n p7v/2 1 ^ 2 -k f /cos4>cos0\ /coS (/> COS A \ 

— — ( sinc^cos^ I , sivLipcosO ] ] cos(6)d(pd0 Si 

^ / V sin^ )) 

2 n p7t/2 p2-k . 

— — J j cos^ (j) cos^ 0M\i -h 2 cos ^ sin cp cos^ 0F\2 

-f 2 cos (p cos 0 sin 0J[\^ -f sin^ (p cos^ 011^22 

-{- 2 sin ^ cos 0 sin 01 2 ^ -{- sin^ ^-^ 33 ) cos{0)d<pd0 ei 



47T 




cos^ 0'kE\i -f cos^ ^7T J22 + 2 cos <9 sin^ Onll^) cos OdO 6i 



= + 1*2 + 43)) ei 

i— 4 
3 

= ^ ^ / 9{^j) Cj). 

j=i 
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This justifies the following definition. 

Definition 4.4,2. Let f:S~^M be a non-degenerate immersed sub- 
manifold of the pseudo- Riemannian manifold (M, g) , The mean curva- 
ture vector field H is defined by 

^ dim(r) 

~ dim (17) ^ ^ ^ 

where {ei, . . . ,edim(i;)} orthonormal basis ofT^M, 

It is easy to see that does not depend on the choice of orthonormal 
basis. The mean curvature vector field plays a prominent role in the 
investigation of black holes (cf. Chap. 9) and is closely linked to the 
theory of minimal surfaces (cf. Lemma 4.4.8 below). The normalisation 
factor is more common in the mathematical literature than the 

more logical alternative 1. Since unusual normalisations are even worse 
then bad normalisations we have retained this factor. 

Definition 4.4.3. Let S be a non- degenerate immersed submanifold of 
(M,g). A (local) vector field n: E TE-^ along E with (n, n) = ±1 is 
called a normal vector field or simply a normal 

The shape operator Sn : TxE TxE of E associated with n is defined 
by {Snu,v) (ff{u,v),n). 

Denote by tTu : {TxE)^ — ^ Mn the orthogonal projection. Then the sec- 
ond fundamental form kn corresponding to n is defined by 7Tn(^(u,u)) — 
k{u,v)n. 

Definition 4.4.4. Let E be a non- degenerate immersed hypersurface 
of (M,g). The shape operator and second fundamental form of a non- 
degenerate hypersurface are simply denoted by S and k. 

The term second fundamental form comes from the theory of hypersur- 
faces in Euclidean space (R^, (•, -)j^ 3 ) which predates Riemannian ge- 
ometry. The normal space of a non-degenerate hypersurface E is 1- 
dimensionah Therefore there exists an (up to sign) unique normal vector 
field n: E TM along E. Hence even without choosing a normal, k 
and S are uniquely determined up to sign. In Euclidean geometry, the 
induced metric g = /* (v)a ^3 is called first fundamental form since it 
allows to determine fundamental geometrical quantities such as angles 
and lengths. The second fundamental form k is another ( 2 ) -tensor field. 
Since the GauB equation reduces to 

(^R{u^v)w^t) = k{u,t)k{v^w) - k{u^w)k(v,t), 

the curvature of the surface E is completely determined by k. Moreover, 
k also determines the shape tensor and therefore how a hypersurface is 
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curved in space. Hence k is indeed a geometrically fundamental quantity 
which justifies the name second fundamental form. 

Lemma 4.4*6. Let U be a non- degenerate hypersurface with normal n. 
Then the shape operator is given by Su = —V and the second fun- 
damental form by k(u,v) ~ ~ (n,n). The shape operator 

S is self-adjoint and the second fundamental form k is symmetric. 

Proof. Let V be a vector field with = v. Then we have {Su^ f^v) = 

The normalisation 

(n,n) = ±1 

implies (v = 0 and therefore the first assertion. For the second 
fundamental form we calculate 

k{u,v) = {][{u,v),n) (n,n) = (n,n). 

The self-adjointness of S and the symmetry of k follow from ][{u,v) ~ 
I{v,u). I 

In analogy to the mean curvature vector field one can introduce the 
average of the shape operator. 

Definition 4.4.5. Let f\S~^M be a non- degenerate immersed hyper- 
surface. Then the mean curvature is given by H{x) = ^^tr(/Cx). 

The most important mathematical application of this concept arises in 
the theory of minimal submanifolds. 

Lemma 4.4.7. Let {M^g) be an oriented pseudo- Riemannian manifold 
and E C M a pseudo- Riemannian hypersurface in M with normal n. 
Then the volume form of E is given by 

= nJ Pm 

(which implicitly defines an orientation of E). 

Proof. It follows directly from Definition 4.2.1 that either n J pM or 
~nJ Pm is the volume form of E. I 



The rest of this section is an illustration of the concept of mean cur- 
vature primarily directed to mathematicians. 
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Let (M, g) be an oriented, Riemannian manifold and consider a compact 
submanifold B C M of codimension 2. We denote the space of all smooth 
hypersurfaces B C M whose boundary is B by dJfl{B). Then the question 
naturally arises which B e M{B) has minimal volume yoI{B) = 

In general, there may not be a minimising hypersurface or it may not be 
unique. Nevertheless, it is relatively easy to derive a necessary condition 
any minimising hypersurface must satisfy. 

Assume that B is a minimising hypersurface and let {/ be a vector 
field such that U(x) = h(x)n(x) for some function h: ^ E. If h{x) = 0 

for all a: G then the flow Ft of U generates a smooth l-parameter family 
Bt = Ft{B) of hypersurfaces in Tl{B), Hence a necessary condition that 
B has minimal volume is given by = 0- 

Lemma 4.4.8. Let (M, g) be an oriented pseudo- Riemannian manifold 
and B be an oriented non- degenerate hypersurface with normal n. Let n 
be a vector field in a neighbourhood of B which satisfies 

(i) n(x) — ii(x) for all x e B and 

(ii) (it, n) “ ±1 

and h be a function and U = hn. For the flow Ft of U we have 

) / - (n - 1) / TjshHfl;^, 

Jt^oJFtiS) Je 




where H is the mean curvature of B and = (n, n) G {~1, 1}. 

Proof. Since all objects are smooth we can interchange differentiation 
with respect to t and integration over B and obtain, using Lemma 2.5.2, 



s) / ' 

y t=o J Ft(s) 






dtj 



[ Frills) =[ (^) Ft*u.E 

,=odE 

= £jj{nJiJ,M) 

= j {[hn, n] J/iM + div(/in)n J hm) 

= J (-d/i(n) + div(/in))n J(Um = J hdiv{h)ns- 



Let {£^ 1 , . . . E„} be an orthonormal frame with En = h and 0^,. . . ,0^ 
be the dual frame. Then we get 



div(n) = 0“(V£;^n) = 2 e^^iVEh) + ^ 



a=l 



a=l 




202 4. Pseudo- Riemannian manifolds 



n—l 



=0 






a=l 



= (n - 1) (n, n) H. 



Since for each function h with = 0 we obtain a 1-parameter family of 
hypersurfaces St € Tl{B) with Sq = S a, smooth hypersurface can only 
extremise volume if H = 0. For if there would be a point x e S \ { 5 } 
with H{x) 7^ 0 then there would exist a neighbourhood of x in S and 
a function h with 

(i) h{y) — 0 for all y e B U {S\ Ux)^ 

(ii) hly)H(y) > 0 for all y G ^ U S^ 

(hi) h{x)H{x) > 0. 

This would imply rjzihHfjiz: 7^ 0 in contradiction to the extremality of 
vol(Z'o)- The equation H — 0 can be understood as a differential equation 
for a function / which describes the hypersurface It is a classical 
problem in differential geometry to solve this equation. A comprehensive 
monograph on this subject (for M = R^) is (Nitsche 1975). 

4.4.1 Hyperquadrics 

In this section we study the simplest non-trivial class of hypersurfaces 
of . These examples will he used in Chaps. 7 and 6. 



Let 

u n 

?7i, : M” X M” — > E, (x,y) ^ x*^y*'. 

j—1 

be the standard pseudo-scalar product of index 1 /. 

Proposition 4.4.3. The pseudo-Riemannian manifold (R^ \ {0}, r^j/) is 
foliated by hypersurfaces Quad^~^(c) := {x e \ {0} : rjjy{x^x) — c}, 
where c G R. The hypersurface Quad]^“^(c) is non- degenerate if and only 
ifc^O. 

These hypersurfaces Quad]}”^(c) are called hyperquadrics. 

Proof, It is clear that every x e R^ \ {0} lies in exactly one subset 
Quad^^“^(c). The set Quad^“^(c) is the zero set of a: 1—^ fc{x) — x) — 
c. Since df = x^dx^ does not vanish unless 

a: == 0 the map / has constant rank in {x G R’^ : x ^ 0 }. Hence 
Proposition 2.1.1 implies that QuadJ^“^(c) is a hypersurface of R’^ \ {0}. 
The tangent space at x is given by {u G R’^ : rjfy{x,v) = 0}. 
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Assume that c = 0. Then we have riu{x,x) = 0 and therefore x € 
Ta;Quad”“^(c). Since for any tangent vector v we have r]y{x,v) = 0 the 
induced metric is degenerate. 

Assume now that c ^ 0. Suppose there was a vector w e T^Quad”"^ (c) 
with r)y{w, u) = 0 for all v 6 Ta;Quad”“^(c). Since we have also r)y{x, w) = 
0, this would imply r]y{w,y) = 0 for all j/ € R”. This is impossible since 
riy is non-degenerate. | 



Lemma 4.4.9. Let c ^ 0. The map 

l: Quad”~^(c) ^ Quad”li(-c), 
(ar\...,a;‘',a;'"+\...,a;”) ^ . . . .x"") 

is an anti-isometry. 

Proof. The hypersurfaces Quad”~^(c) and Quad”l^(-c) coincide since 
rjy{x,x) = -r}n-yit{x),L{x)) 

for all X 6 The map t is an anti-isometry since for each pair of 
vectors u,v £ Ta;Quad"“^(c) = Ta;Quad^li(-c) we have r]y{v,w) = 
-r]n-yii'*v,i»w). I 

Lemma 4.4.9 imphes that it is possible to restrict attention to those 
hypersurfaces Quad”“^(c) with c > 0. 

Definition 4.4.6. The pseudo-sphere of dimension n, index u, and ra- 
dius r is the hyperquadric = Quad”“^(r^). If u = 0 we unite 

instead o/5o“^(r) and S'^~^ instead o/5”“^(l). 

Lemma 4.4.10. The pseudo- sphere SJ^~^(r) is diffeomorphic to R'^ x 
gk fjgjiQigg ihg k-dimensional unit sphere. 

Proof. The diffeomorphism : R” ^ R", x i-> maps Sy~^(r) onto 
Sy~^{l). It is therefore sufficient to prove the lemma for r = 1. Con- 
sider the map f:Wx (y^z) {y,^/tT^ z). The 

equation Vu{f(,y,z),f{y,z)) = 1 implies that the image of / is a sub- 
set of 5”“^(1). The map / has also a smooth inverse map, f~^{y,z) = 

{Vt \/l + \y\^ z), hence it is in fact a diffeomorphism onto I 

Lemma 4.4.11. Let c ^ 0. The hyperquadric Quad”“^(c) is a manifold 
of constant curvature 1 /c. 




204 4. Pseudo-Riemannian manifolds 



Proof, The shape operator is given by 



Sv = -D 




{v) = 




whence the shape tensor reads 



F{u,v) = -r]^ 




C X 



and the GauB equation (Proposition 4.4.1) reduces to 

r]^{R{u,v)v,u) = {■nu(u,u)r]u{v,v) -■nu{u,v)^) . 



4.4,2 Umbilic and totally geodesic submanifolds 

The investigation of submanifolds is a classical field in differential 
geometry. Naturally, one concentrates on submanifolds whose shape 
tensor is of especially simple form since only for these classes one has 
a chance of at least a partial classification. In this section we collect 
those elementary definitions and results which should be covered in a 
course of differential geometry. 

Readers who are primarily interested in physical aspects can skip 
this section. 

Definition 4.4.7. Let f : E ^ M be an immersed pseudo- Riemannian 
submanifold of a pseudo- Riemannian manifold 

A point X €. E is called umbilic if there exists a vector n £TxM with 
][{u,v) = f*9{u,v)n for all u,v £ T^E, The submanifold E is called 
(totally) umbilic if all points in M are umbilic. 

The submanifold E is called totally geodesic if ff — 0, 

Lemma 4.4.12. Let f:E^M be an immersed non- degenerate sub- 
manifold of the pseudo- Riemannian manifold (M, g) , Then the following 
statements are equivalent. 

(i) E is totally geodesic; 

(a) For every curve 'y C E we have: j is a geodesic of (E^f*g) if 
and only if f oj is a geodesic with respect to (M,g), 

(Hi) For every v £ T^E we have: Ifj is the maximal geodesic in M 
with 7(0) = f^v then there is a 5 > 0 such that 7([— 5>^]) C f{E) 
(iv) For every curve 7: [a, 6] E and every v £ Ty[a]-S’ we have: 
The parallel transport of v along 7 satisfies f^lP^{v) = P^ ^ ^(/*?^)* 
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Proof, (i) (ii): Recall that ^ + Clearly, 

J = 0 implies that the curve 7 is a geodesic if and only if / o 7 is. 
Conversely, from the symmetry of S we infer that 

j = 0 ff{v,v) ~ 0 Vu. 

The assertion follows from the decomposition above since for each v € 
TxS there is a geodesic 7 with 7(0) = v. 

(ii) (iii): Trivial. 

(i) (iv): Let 7 be a curve and v € The parallel transport 

P^(u) of V along 7 satisfies 

0 = = Vf^A^UP^iv) - I{j,Pj{v)). 

Hence the equivalence “(i) ^ (iv)” follows in the same way as “(i) <!=!> 
(ii)”. ■ 

The following lemma shows that totally geodesic submanifolds are char- 
acterised by the infinitesimal neighbourhood of a single point. 

Proposition 4.4.4. Let f:E~^M and f:E-^Mhe two totally 
geodesic submanifolds of the pseudo- Riemannian manifold {M^g), If there 
are points x e S, x e S with f^TxS =J^Tx^ then there are neighbour- 
hoods U of X andU of x with f{U) — f(U). 

Proof, Since exp^ is a local diffeomorphism near Ox € TxLJ there are 
neighbourhoods W, >V of a:, ^ which are swept out by geodesics through 
X, X. The image of these geodesics under /, / are geodesics of M by 
Lemma 4.4.12. Since the tangent vectors of these two sets of geodesics 
at f{x) — f{x) each form a neighbourhood of 0 6 f^TxLJ — f^Txf, there 
is a neighbourhood U of f{x) — f{x) with U Pt /(W) —UC\ /(VV). I 



4.4.3 Warped products 

Many standard spacetimes have a generalised product structure^ the 
warped product structure. To write down curvature expressions for 
this class in general will save work in Chaps, 6 and 8, 

Definition 4.4.8. Let{E,g^), {F^gr) be pseudo- Riemannian manifolds 
of dimensions np andr: E {0} be a smooth function. Then 

the pseudo- Riemannian manifold 

X F,tte*9s + (»■ o , 

where fe: S x F ^ S and ttf - S x F F) are the canonical projec- 
tions, is called the warped product of E and F with warping function 
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We can identify vector fields X on S (respectively, V on F) with the 
vector field X satisfying ~ X , kf^X — 0 (respectively with V 

satisfying ~ 0, v.j, *'/ — V). vVe caii X (respectively, V) the lift 

of X (respectively, F). In the following, we will make use of this iden- 
tification and denote both vector fields by the same letter. Further, for 
any vector field ^ on E x F there are unique vector fields X on T and 
F on F with ^^X + V. 

For every x e M we denote the submanifold E x {'Kf{x)} oi E x F 
by Ex and the submanifold x F by F*. 

Lemma 4.4.13. Let X,Y be vector fields on E x F which are lifts of 
vector fields on E and U, F vector fields on E x F which are lifts of 
vector fields on F. Then 

(i) = V^^^YTTs*X, 

(a) itp^X/yX = 0 , 

(in) Vj^f/ = = d(lnr)(X){7, 

(iv) ■ns*VijV = - (C/, F) grad(lnr), 

(v) = ^t:f^U'^f*V. 

Proof These equations can be verified using the Koszul formula (Equa- 
tion (2.7.7)). 

(i), (ii): Since ff- Ex E is &,n isometry, we only have to show 
that (V;^y, F) = 0 for all vector fields V which are tangent to the fibre 
F. From Proposition 2.4.4 we get {ff)*[X,V] = 0 and ( 7 rF)*[X,F] = 0 
since {fs)*V 0 and {nF)*X = 0. The Koszul formula (2.7.7) reduces 
therefore to 2(Vj^F,F) = -V (X,Y) - (F,[X,F]). Since {X,Y) is a 
function on E the first summand vanishes. The second summand van- 
ishes since [X,Y] is tangent to 

(iii) : Prom [X^ ?7] — 0 we get := ^X^' covariant derivative 

is tangent to the fibres since (>", == - (V;^y,C/) = 0 

for all vector fields Y which are lifts of vector fields on E. The Koszul 
formula implies 

2 {VxU, V)=X {U, V) = X{r'^gF{U, F)) = 2rdr{X)gF{U, V) 

= fdr(;f) {U,V). 

(iv) : The equation follows from 



(V[/F,X> = - {YXjjX) = - (^F, ldr{X)U^ 

= -l{U,V){gmd(r),X). 

(v): For each x e M the fibre Ft is a submanifold of M whose induced 
metric g\F is a constant multiple of the metric on F, = r'^{x)gF- 
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Hence their Levi-Civita connections coincide and the assertion follows. 



Corollary 4.4.1. Let 7 = ( 71 ;, 7f) ^ curve in S x F. A curve 7 is 

a geodesic if and only if 

(i) = (7F,7F)grad(lnr), 

(ii) V^^7 f = -2d(lnr)(7£)7F- 

Proof. We can write j{t) - X{j{t)) + V{y{t)), where X,V are vector 
fields such that X (7(t) is tangent to S^^t) and V {jit)) is tangent to F^^t) 
at all t. Then we have V^7 = + X + VyX + V^V and the 

assertion follows if we project this vector to TS^ and TFx- i 

Lemma 4.4.14. Let X, Y, Z be vector fields on E x F which are lifts of 
vector fields on S and U, V, W be vector fields on E x F which are lifts 
of vector fields on F. Then 

(i) ns*R{X,Y)Z = Rs{Trs*X,ns*Y)'Ks*X, 

(ii) -kf»R{X,Y)Z = 0, 

(iii) R{X,Y)U = 0, 

(iv) R(X,U)Y = ^VXr{X,Y)U, 

(v) R{X, U)V = -i {U,V) V;^grad(r), 

(vi) R{U,V)X = 0, 

(vii) 7rs^R{U,V)W = 0, 

(viii) nF*R{U,V)W = Rf{U,V)W 

(grad(r),grad(r)) {{U, W)V - (V, W) U). 

Proof Assertions (i) and (ii) follow directly from Lemma 4.4.13 (i) and 
(ii). 

(iii) : We may choose X, Y such that [AT, T] = 0. Then 

R{X,Y)U = VxVyC/ - 

= (dlnr(y)17) - Vy (dlnr(A)C/) 

= (VVln(r)(X,y) -dlnr(Vx>") +dln(r)(F)dln(r)(X)) U 
- (VVln(r)(r, A) - dlnr(VyA)+ 
dln(r)(A)dln(r)(Y))!7 
= dlnr([A,y])[/ = 0. 

(iv) : Since [X, 17] = 0 we have 

r{x,u)y = Xx^uY-^u^xY 

= Vx (dlnr(F)17) - dlnr(Vx^)t^ 
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= VV In r{X, F)C/ + d In r(V xYp 
+ dlnr(F)V;fC/ - dlnr(V^y)!7 
= (VVlnr(X, F) + d In r(X)d In r(F)) U 

= ivVr(X,F)t/. 

(vi): We can directly calculate 



•^(^1 — V[/V^X — VyVijX - V\^^y^X 

= V[/(dlnr(X)F) — V^(dlnr(X)17) — dlnr(X)[{7, V] 

2 1 

= VV(lnr(X))(C/,F) +'dlnr(i)Vc/F - VV(lnr(X))(F, [/) 
2 2 

- d In r (i) Vy 17 - d In r{X) [U, V] 

= 0 



where we have used that [17, V] = VjyF — XyU. 

(v): Since {R{X, U)V, W) = - {R{V, W)X, U) = 0, the vector 

R{X, U)V 

must be tangent to Ex- The assertion follows from 

{R{X, U)V, Y) = - {R(X, U)Y, V) = -i VVr(X, F) {U, V) . 

(vii): This follows from {R{U, V)W, X) = - {R(U, V)X, W) = 0. 
(viii) Observe first that the Levi-Civita connection induced on the fi- 
bre Fx equals the Levi-Civita connection of gp since both metrics diff er 
only by a constant factor r'^{x). The result follows from the Gaufi equa- 
tion (Proposition 4.4.1) since by Lemma 4.4.13 (iv) the shape tensor is 
given by E{U, V) = - {U, V) grad(lnr). I 



Lemma 4.4.15. Let X,Y be vector fields on E x F which are lifts of 
vector fields on E and U, V be vector fields on E x F which are lifts of 
vector fields on F. Then 

(i) Ric(X,F) = mcs{'ns^X,'nE*Y) - 2^VVr(X,F), 

(ii) Ric(X, U) = 0, 

(vii) Ric(C/,F) = Ricp{U,V) - (grad(r),grad(r)>) 

x{U,V), 

(iv) Seal = Seals + ^^Scalp - ^Ar - (grad(r), grad(r)) . 
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Proof. Since Ric{X^Y) = tr{R{-^X)Y) = 

(i) follows directly from Lemma 4.4.14 (i), (iv) while assertion (ii) is a 
consequence of Lemma 4.4.14 (iii) and (v). Formula (iii) is implied by 
Lemma 4.4.14 (v) and (viii) and assertion (iv) is just the metric trace of 
(i) and (iii). I 



4.5 Isometries and Killing vector fields 

An isometry is a dijfeomorphism which preserves the metric, pseudo- 
Riemannian manifolds with many isometries are especially simple. 
The relevance to the theory of space and time comes from the fact 
that observations indicate that our universe is well approximated by 
Lorentzian manifolds with many isometries (cf. Chap. 6). 
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Definition 4.5,1. Let (M,g) and {M,g) be pseudo-Riemannian mani- 
folds. An isometry is a diffeomorphism (p: M M which preserves the 
metric f {<(>*§) = g. A local isometry is a local diffeomorphism (p such that 
{(p*g)x ~ 9x od all points x 6 M. 

Lemma 4.5.1. Let (M^g) and {M,g) be pseudo-Riemannian manifolds 
andU C M be a connected open set. If (p,'ip\U M are local isometries, 

then (p — 'ip if and only if there is a point x £U with Tx<p = Tx'ip, 



Proof. The two isometries clearly coincide on the closed set V = {y G 
U : Ty(p - Ty'p}. Since V is non-empty and U is connected, we only 
need to show that V is open. Let y G V and W be a normal neighbour- 
hood of y. Then for every z gW there is a vector v[z] eTyM with z — 
expy{v[z]). But this implies (p{z) ^ <p{expy{v[z])) = exp^^y){TyCp{y[z])) = 
exp.,p^y){Ty'ip{y[z])) = 'ip{expy{v[z])) = 'tp{z). Hence ^|w = and 

therefore W C V. ® 



Definition 4.5.2. A Killing vector field is a vector field ^ whose flow 
defines local isometries. 

Lorentzian manifold (M, g) is stationary in a region U C M if there 
is a timelike Killing vector field in U. It is static in U if this Killing 
vector field is orthogonal to spacelike hypersurfaces. 

Clearly, only very special pseudo-Riemannian manifolds can have non- 
zero Killing vector fields. A simple example is given by a metric which 
does not depend on one of the coordinates. Then the corresponding 
Gaufiian vector field is a Killing vector field. 

Lemma 4.5.2, A vector field ^ is Killing if and only if £^g = 0 if and 
only ifV^^ is antisymmetric. In this case we have = •|d^^ 
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Proof. The first equivalence is clear since the Lie derivative is the 
derivative along the integral curves of In order to prove the second 
equivalence we calculate 



(£^gKU, V) = £^ {U, V) - v) - (u, £^v) 

* * 

★ ^ ^ ^ ^ 

= (v^u, v) + V) - (u, 

+ {u,Vv^) 

= V^\U,V) + V^\V,U). 

Here we have used that for a Levi-Civita connection the terms marked 
with a * add to zero. It follows that ^ is a Killing vector field if and only if 
is anti-symmetric. Now the assertion follows from (d^'’)a6 = 2 V[o^w. 

[p- 209 i]| |‘ 
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Proposition 4 . 5 . 1 . Let ^1,^2 be Killing vector fields. Then [^1,^2] is 
also a Killing vector field, i.e., the Killing vector field on a pseudo- 
Riemannian manifold form a Lie algebra. 



Proof. We have only to show that the commutator of two Killing vec- 
tor fields is a Killing vector field. From Proposition 2.4.3 we know that 
£ [^, ?7]^ - for any tensor ip. In particular we obtain Trt m = 

= 0-0 = 0 . ’ | 



Lemma 4 . 5 . 3 . Let ^ be a Killing vector field and 7 &e a geodesic. Then 
is a Jacobi field and s 1— > (^7(s),7.y(4)) is constant. 

Proof. Denote the flow of ^ by Ft. Since Ft is an isometry for each t 
and s i-i 7(5) is a geodesic, the curve s ^4(7(5)) is also a geodesic. 
Hence (s,t) Ft('j{s)) is a variation of geodesics and its deviation 

vector field, ^Ft'j{s) = ^7(3)) is a Jacobi field. For the second property 
note that V^'’ is anti-symmetric by Lemma 4.5.2. Hence V..y(^,7) = 

(V;ye, 7 ) + (^, = V^^ 7 , 7 ) = 0 . I 



In the rest of this section we will investigate highly symmetric pseudo- 
Riemannian manifolds. These results are of independent mathemati- 
cal interest and will be used in Chaps. 7, 6. 

Definition 4 . 5 . 3 . A pseudo- Riemannian manifold (M,g) is called lo- 
cally symmetric if VR — O. 

This definition implies that the components of R with respect to a par- 
allelly propagated frame are constant functions. 
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Lemma 4.5.4. Let (M,g) be a pseudo-Riemannian manifold. It is lo- 
cally symmetric if and only if for every curve 7 and all vector fields 
f7, V, W wich are parallelly propagated along 7 the vector field R{U^ V)W 
is also paralley propagated along 7. 

Proof. The equation Vi? = 0 implies for parallelly transported vector 
fields U,V,W 



W^{R{U, V)W) = (U, V)W + R{V^U, V)W 

=0 =0 
+ R{U,'^)W + R{U, V)V^ 
==0 



along 7. Hence R{U, V)W is also parallel along 7. 

Conversely, let u,v^w e T^M and 7 be a curve with 7(0) = Let 
U^V^W be the parallel propagation of along 7 and assume that 

the vector field R{U,V)W along 7 is also parallel. Then the assertion 
follows from 



=0 



=0 



{u,v)w = V^^Q-^iR(U,V)W) 

=0 =0 

- -R(w, - R{u, v) V^(o)^ 

= 0 . 



w 



Theorem 4.5.1. Let (M,g) and (M^g) be locally symmetric manifolds 
and X e Mj X e If there exists a linear isometry A: TxM T^M 
with AR(uyv)w = R{Au^ Av)Aw for all u^v^w 6 TxM, then there are 
neighbourhoods U,U ofx, x and a unique isometry <f\ti ^lA with Tx<j> = 
A. 

Proof. We only need to prove existence since uniqueness follows from 
Lemma 4.5.1. We will show that for some normal neighbourhood U of x 
the map <f\U M, y 1—^ exp^Oi4 o exp^”^ is a local isometry. First 
note that (f is well defined if U is sufficiently small. By Proposition 
2.6.5 there is for every y e U unique Wx ^ TxM with exp(u;a;) == 
y. Further, for every Uy 6 TyM there is a unique € T^,^{TxM) 
with exp^(w^y^) = Uy. Since e C T^.^TM there is a 

vector Ux such that ~ follows therefore from 

Proposition 2.9.5 that {uy,Uy) = J(l)), where J is the unique 
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Jacobi vector field along the curve 7: ^ /(0,t) with J(0) = 0 and 

V^J(O) = Ux- Rom the definition of cj) we get 

(p^{uy) = T exp^TATexp^~^{uy) - Texp^TA{u^^) = Texp^{AuAwJ 



and by the same argument as before it follows that there is a Jacobi 
field J along the geodesic 7: t i-> exp(^i4^^;a;) which satisfies J(0) = 0, 
V;^J(0) ^ Aux, g{(j)^{uy),^^{uy)) = J(l)). 

Let {E \^ . . . j En} be a parallelly propagated frame along 7 with E\ — 
7 and let {£^1, . . . , En} be the unique orthonormal, parallelly propagated 
frame along 7 with ^^(0) = AEi {% e {1 , . . . ,n}). With respect to these 
frames the Jacobi equations for J and J are given by 



dV^ 



k=l 



and 



d2j* 

d«2 



n 






Here we have used that 7 is the unique geodesic with 7(0) = ^7(0) which 
implies that E\ — 7. The functions R^\ki and R\ki are each constant 
by Lemma 4.5.4. Since we assume AR{u,v)w ~ R{Au,Av)Avj for all 
u^VjW G TxM^ the definition of our parallel frames implies R^iki “ ^\kl 
for all i,fe. Further, the functions satisfy J^(0) = J^(0) = 0 and 

(by the definition of our frames) ^ J^(0) = ^ J^(0). Hence the funda- 
mental theorem for differential equations 2.4.1 implies J^{t) ™ J^{t) for 
all k and we get 

n 

g{<l>.Uy,<l>.Uy)=^g{J(l),Jil))= E 

i^k~l 

= E JHi)j\i)~9mo).mo)) 

i^k=l 

n 

= E ni)jHmAEi{o),AEkm 

iyk~l 

n 

= E JH^)JHmEi{ 0 ),Ek{ 0 )) = g{J{l),J{l) 

i^k—l 

= g{Uy,Uy). 



and the assertion follows from the polarisation identity 



= 2^9{u + v,u + v) 



9{u,u) - 9{v,v)). 



I 
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Proposition 4.5.2. Let{M,g) and{M,g) be pseudo- Riemannian man- 
ifolds with constant curvature c and c. They are locally isometric if and 
only if they have the same dimension and signature and satisfy c = c. 

Proof. Observe first that the conditions are necessary. 

We show now that a pseudo-Riemannian manifold with constant cur- 
vature is necessarily locally symmetric. Let f , u^v^w e TxM and C/, V, W 
vector fields which satisfy Ux = u^Vx = u , Wx = w and whose covariant 
derivatives vanish at x. Prom Proposition 4.3.3 we get 

(V^i?) {u, v)w = V)W) - R{V^U, v)w 

- R{u, V^V)w ~ R{u, v)V^W 
- cV^ {{V, W)U~ {[/, W) V) = 0, 

If (M^g) and (M^g) have the same dimension and signature then for 
any x G TxM^ x G T^M there exists a linear isometry A: TxM TxM. 
If (M, g) and (M, have the same constant curvature then this isom- 
etry satisfies AR{u., v)w = R{Au^ Av)Aw for all u^v^w G TxM and the 
assertion follows from Theorem 4.5.1. I 



Corollary 4.5.1. Let (M^g) be a pseudo- Riemannian manifold with 
non-zero constant curvature. Then there is a hyperquadric Quad^“^(c) 
(c^O) which is locally isometric to {M^g). 

A global classification of pseudo- Riemannian manifolds with constant 
curvature is much more difficult (Wolf 1977). The following Lemma in- 
dicates that Hyper quadrics have a very large isometry group. 

Lemma 4.5.5. Let c 7 ^ 0. and x^y G Quad^“^(c). For any pair of 
orthonormal bases {ei, . . . , Cn] C TcQuad^“^(c) and {/i, . . . , /^} c 
TyQuad^“^(c) there is an isometry Quad^“^(c) — ^ Quad^“^(c) with 
= fi (iG n}). 

Proof. Let he the linear map which maps a;, ei, . . . , to 

2 /? /i> • • ♦ )/n- It is an isometry of onto itself and therefore also 

maps QuadJ5'~^(c) onto itself. Since ~ = /i, the restriction 

<;6 of ^ to QuadJJ“^(c) is the desired isometry I 



4.6 Length and energy functionals 

In this section we will study the problem of extremising the length 
functional and find necessary and sufficient conditions in the Rie- 
mannian and the Lorentzian case. Here we will lay the foundation 
for many surprisingly strong global theorems in dijferential geometry 
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(e.g. the Theorem of Myers, cf, (O'Neill 1983, theorem 10.24)) ctnd 
for the singularity theorems in general relativity (cf Chap. 9). 

This section is mathematically more involved than the other sec- 
tions in this chapter and can be skipped on first reading. 

This section uses material from Sect. 2. 9. 

In Riemannian geometry the length of a curve measures how much wire 
one would need to model the curve in space. It is a fundamental geo- 
metrical experience in Euclidean geometry that for any given (not too 
distant) pair of points there is a curve of shortest length which connects 
them. 

In Lorentzian geometry the length of a causal curve can be inter- 
preted as the proper time an observer needs in order to traverse this 
world line. Since in special relativity moving clocks are slower (twin para- 
doxon) one expects that for any two (not too distant), causally related 
point there is a longest causal curve which connects them. 

For other signatures the problem of extremising length does not lead 
to non-trivial results (cf. Lemma 4.6.9) 

For the discussion in this section it is technically advantageous to 
widen the class of admissible curves to the continuous, piecewise smooth 
curves. The advantage lies in the fact that in many situations it is much 
easier to construct a continuous, piecewise smooth curve with certain 
properties than a smooth curve. 

Definition 4.6.1. Let {M,g) be a pseudo-Riemannian manifold and 
7: [a,b] ^ M be a continuous, piecewise smooth curve in M. 

Then the length of ^ is defined by L{'y) := ^/\9('y(t)^j{t))\ dt. 

This definition makes sense since a piecewise smooth curve has a well 
defined derivative everywhere but on a set of measure zero. It is inde- 
pendent of the chosen parameterisation. In the case of Euclidean space 
it coincides with the length one would define through the approxima- 
tion of 7 by polygons. The following lemma guarantees that there are no 
repercussions in considering piecewise smooth curves instead of smooth 
curves. 

Lemma 4.6.1. Let [a, c] M be a piecewise smooth curve. Then 
there is a sequence of smoothly immersed curves 7^ : [a, c] M which 
converge pointwise to 7 and satisfy Ti{t) jit), lim^_,.oo ^(7i) = ^^(7)‘ 

Proof Assume that 7 is the concatenation of two smooth curves p : [a, 6] 
M and A: [b,c] ^ M where (i{b) = A(6). We choose a coordinate system 
(xS . . . , .x’^) such that A is given by t (t, 0, . . . , 0). Let io e N such 
that b — > a and let U = b — 2“^ where i> io and i € N. By Lemma 

2.1.7 there are smooth functions [a,c] [0, 1] such that 
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'= 1 


for all t € [a.^ti^i 


^i{t) < > i 


for all t < 


= 0 


for all t e [6, c], 


'=0 


for all t G [a.ti^i 


'Ipiit) < > \ 


for all t > ti], 


= 1 


for all t G [6, c], 



We define the curve fii with respect to our coordinate system by 

+ / {‘Pi{s) (/i^(s) + cfv>i(s)) + S'li)i{s)) ds, 

where the constants are determined by the condition A(6) = Jiib). 
Notice that /i(t) = ji{t) for t < and 7i(t) — X{t) for t > b. Since fx is 
smooth there is a number c > 0 such that \fi^{t)\ < c and \ix^{t)~X{b)\ = 
\ix^{t) — ix{b)\ < c\\b — t\ for all t G [a, 6]. Hence using > 1/4 for 

t e we obtain 




< (2c + 1)(6 — U^i), 

From ti — b — we get |c^| < 2(2c 4- 1). Since c-^, are 

uniformly bounded with respect to i, the curves 7^ converge pointwise 
to 7 and the lengths of 7^ converge to the length of 7. I 

Corollary 4.6*1* Assume that (M^g) is a Lorentzian metric and that 
7 : [a,c] M is piecewise smooth future directed causal curve. Then 
there is sequence of smoothly irhmersed timelike curves 7^ : [a, c] — > M 
which satisfies ji{t) — ^ 7(^), ^i{t) — > 'y{t), and L('y). 

Proof. We can assume without loss of generality that 7 is the concate- 
nation of two smooth curves mu: [a, 6] ^ M and A: [6, c] — > M with 
fx{b) = X{b) such that both fi{b) and A (6) point into the same future 
cone. There are sequences of timelike curves (Xi and A^ which converge 
to fx and A and satisfy jXi{b) = Xi{b) — p{b). For each such pair of curves 
Lemma 4.6.1 provides a sequence 7^,^ of curves such that 7^^- converges 
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to the concatenation of jXi and \i. Since both, and are timelike and 
future directed so is for j enough. We can assume without loss of 
generality that all 7^^- are timelike. It follows that the sequence {'Ji,i}i£n 
consists of timelike curves and converges to 7. I 



4.6.1 Variation of length and energy 

In Euclidean space, the shortest curve between two points is the straight 
line connecting them. In Minkowski space, the longest causal curve be- 
tween two points x,y e is also the straight line connecting them. 




Fig. 4.6.1. A broken lightlike geodesic can be 
smoothed out by a curve of arbitrarily small length 



In a general Riemannian manifold “without holes” it is intuitively 
clear that any two points can be joined by at least one shortest curve. 

In a Lorentzian manifold, the infimum over the length of all curves 
which connect x and y is always zero since we can join any two points 
by a broken lightlike geodesic which then can be smoothed out to give 
a smooth curve of arbitrarily small length (cf. Fig. 4.6.1 and Corollary 
4.6.1). It is also clear that there does not exist a curve of maximal length 
connecting x and y since we can always choose a spiralling spacelike curve 
of arbitrarily large length (cf. Fig. 4.6.1). However, we will see below that 
in many situations there exist curves connecting causally related curves 
X and y which maximise L in the class of all causal curves.^ 

pseudo-Riemannian manifolds which are neither Riemannian nor Lo- 
rentzian do not admit any non-trivial solutions to the length extremising 
problem, even if one restricts to spacelike or timelike curves. These ar- 
guments will be made precise in Lemma 4.6.9 below. 

^ Our examples also imply that the length extremising problem does not have 
a solution if one restricts to spaceiike curves instead of causal curves. 
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Fig. 4 . 6 . 2 . A curve minimising the 
distance between two spacelike sub- 
manifolds El and E2 



We will actually investigate the slightly more general problem where 
the endpoints y are replaced by submanifolds without boundary Ei^ 
E2 (cf. Fig. 4.6.2). In order to solve the length extremising problem in 
the Riemannian and in the Lorentzian case we will study 1-parameter 
families of curves /: [a, 6] x (-e, e) M, (s,t) /(s,t) such that 

f{s,a) =6 El and f{s^b) e E2 for all $. If 7 extremises the length 
functional L for all smooth curves^ which connect Ei with E2^ then we 
have ')) 1-parameter families with j{t) — /(O, t). 

Through the investigation of ♦)) we will arrive at sufScient 

conditions. 

Definition 4.6.2. Let Ei^ E2 be submanifolds of M and 7: [a, 6] ^ M 
be a curve which connects Ei with E2- 

A continuous variation /: (— e,c) x [a, 6] M, (s,t) /(s,t) 0/7 is 

called piecewise smooth if there are numbers ti^. , . ^tk € («, J>) such that 
/|(-e,e)x[£i,<:i-}-i] smooth, whcrc to := a,t/e 4 .i := b and i G {0, . . . , fc}. 

A (continuous f piecewise smooth) variation f of j connects Ex with 
^2 ¥ f{^y<^) ^ A ^ ^2 for all s € (~e,e). 

We denote the vector field T(^s,t)f{ 9 s) along f by fs^ the vector field 
T(^s,t)f{^t) along f by ft (where defined), and call the (piecewise smooth) 
vector field ^{t) := {fs))\s=o olong 7 the variation vector field. 

Lemma 4.6.2. Letj: [a,b] M be a smooth curve which connects two 
submanifolds Ei, E2> For any vector field ^ along 7 with ^{a) e Ty(a) A? 
^(6) € there exists a variation / 0/ 7 which connects Ei with 

E2 and which has variation vector field 

Proof. Let C Ex and p.2 C E2 be smooth curves with /ii(0) = 7(a), 
^ 2 ( 0 ) — 7(6), Ai(0) = ^(a), and fi 2 { 0 ) = ^(6). We can now extend ^ to 
a vector field E such that /i^ {i G {lj2}) are integral curves of S’. If F 
denotes the flow of S we set f{$,t) = Fs{'y{t)). I 



® In the Lorentzian case: all smooth, causal curves. 




218 4 , Pseudo-Riemannian manifolds 



If 7: [a,6] -» M is piecewise smooth then 7 is discontinuous at those 
points where 7 fails to be smooth. We will therefore need the following 
technical definition. 



Definition 4 . 6 . 3 . Let j be a continuous, piecewise smooth curve and V 
be a piecewise smooth vector field along 7. For each to € [a, b] we set 

AV{to):=^ lim V(t)- lim V(t). 

It is clear that AV (to) ~ 0 if and only if V is continuous at to- 

Lemma 4 . 6.3 (First variation of arc length). Let 7: [a,b] M be 
a spacelike or timelike, continuous, piecewise smooth curve, let rj = 
sign((7,7)), and f: [-e,e] x [a,b] M,(s,t) f(s,t) be a continu- 

ous, piecewise smooth variation of 7 with variation vector field De- 
note by ti, , . , ,tk e (a, b) the points where 7 fails to be smooth. Then the 
derivative of L with respect to s is given by 




Proof If e is small enoug h, all curves /(s, •) are either timelike or space- 
like, and the integrand \/\ {{ftJt)) | is differentiable for all (s,t) where 
/ is differentiable. We can therefore exchange on every smooth piece of 
7 the differentiation and the integration. Now the assertion follows from 






I t 

dJs 



\VWi 






ft 

'Jn {ft, ft ) , 



,fs 



Corollary 4 . 6 . 2 . Let (M,g) be a Riemannian or Lorentzian manifold. 
Si, S2 be two submanifolds without boundary, andj be a curve connect- 
ing Si with S2, 

(i) If (M,g) is Riemannian and 7 is shorter than any other (neigh- 
bouring) curve connecting Si and S2 then ^ is a pregeodesic which 
intersects both Si, and S2 orthogonally. 




4.6 Length and energy functionals 219 



(a) Assume that Ei, S2 are Riemannian submanifolds or degenerate 
to a point If (M, g) is Lorentzian and 7 a causal curve which is 
longer than any other causal curve connecting Ei and E2 then 7 is 
a causal pregeodesic which intersects both Ei, and E2 orthogonally. 

Proof, We prove only (i) since the other case is completely analogous. 
We assume first that 7 is not a pregeodesic. Observe that it is a 

pregeodesic if and only ifV(t) ( ^1(77)1 ) ^ Hence by 

our assumption there would be a point to ^ {(^^b) with V(to) •=/=■ 0 . Let 
be a vector field along 7 such that {V{to),^i{to)) < 0 . BY continuity 
there is a neighbourhood of to such that (V'(t), ^i(t)) < 0 for 

all t e (t_,t-}-). Let now </? be a smooth, positive function with support 
in ^(to) ^ 0. (Such a function exists by Lemma 2.1.7). 

Then {V{t),(p{t)^i{t)) > 0 for all t e [a, 6] and does not vanish in a 
neighbourhood of to* Taking a variation with variation vector field ^ = 
ip^i we obtain therefore (^L(/(s, *)))|^^q < 0 . This implies that there 
are shorter curves than 7 in contradiction to our assumption. 

We assume now that 7 is a pregeodesic but does not intersect Ei 
orthogonally. Then there is a vector v e with (7(a), u) < 0 . 

Let ^ be a variation vector field with ^(a) = v and ^(b) ^ 0. Since 
^(a) is tangential to Ei there is a variation / of 7 with variation vector 
field ^ such that /(s, •) connects Ei with E2 for all s. Again we obtain 
(^L(/(5, •))) < 0 in contradiction to our assumption that 7 is length 

minimising. It follows that 7 intersects E± orthogonally. 

Finally observe that the same argument holds equally well for E2. I 

The discussion above does not apply to null curves since L involves the 
square root of (7,7). This problem can be avoided if one considers the 
energy of the curve 7, £‘(7) := | (7(^)7(0) 

Unlike the length functional this integral expression does depend on 
the parameterisation of the curve. While it is not true that spacelike or 
timelike curves which extremise L also extremise the energy integral, we 
will see below that this property almost holds. 

Lemma 4 , 6*4 (First variation of energy). Let 7: [a,6] -4 M 6e a 
continuous, piecewise smooth curve and /: [-e,e] x [a, 6] -4 M,{s,t) i-> 
f(s,t) be a continuous, piecewise smooth variation with variation vector 
field Denote byt\,,,.,tk ^ b) the points where 7 fails to be smooth. 
Then the derivative of E with respect to s is given by 



^ The name “energy” comes from the fact that in the Riemannian case the in- 
tegrand is just the kinetic energy of a mass point of mass 1. In the Lorentzian 
case the integrand has nothing to do with energy. 
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|s— 0 a ^ 

Proof. Consider a piece of 7 where it is smooth. The assertion follows 
from 

“ {ft, ft) = (^fuVdJt'j = (^fuVdJe'j = ifufsY - (W djtjfj • 



It follows that a curve which extremises energy is a geodesic (and not 
merely a pregeodesic). 

We will now derive sufficient conditions for curves to ext remise the 
length between submanifolds without boundary. But first we need a tech- 
nical lemma. 

Lemma 4 . 6 . 5 . Let j be a spacelike or timelike pregeodesic and denote 
the orthogonal projection to the orthogonal complement of 'y by 
Then for every vector field V along 7 the formula 
holds. 

Proof. The vector field V can be decomposed into its part orthogonal to 
7, W, and its part tangent to 7, where is a smooth function. Prom 
y z= (p'j -\-W and (7, W) ~ 0 we get 

= dy^(7)(7)^ + ^ = (^7^)^ ' 

where we have used that a curve is a pregeodesic if and only if V^7 || 7. 
The assertion follows now since 

( (v^w) , 7) = {W, iY -(w, V^-y) = 0-0 = 0 

implies ^ = I 

In the following we will freely interchange T and when 7 is a geodesic. 

Lemma 4 . 6.6 (second variation of arc length). Let 'j be a space- 
like or timelike geodesic with r/ = (7,7) € {-1,1} and let /: [— e,€] x 
[a, 6] — > M, (s,f) t— ^ /(s,t) be a continuous, piecewise smooth variation 
1- If denote by t\, , . , ,tk G (a, b) the points where f{s, •) fails to be 
smooth then the second derivative of L o f with respect to s is given by 

(^^(/(®--)))^^_o =»? £ 7)^,7)) dt 
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!s=0 



+ v{^(VdJs 

= -V f + i?(e-", 7 ) 7 ,e-') dt 

k 



i=l 



+n va/. 



|s=0 



where ^ denotes the variation vector field and the orthogonal pro- 
jection to 

Proof. Using the formula for the first derivative in the proof of Lemma 
4.6.3 we obtain 



d^ 

ds2 



\/v {ft, ft) 

^(fuV dJt)(fuV dJt 



ifu ft) 



+ 



VviftJt) 



g ( (v dA V dsft) + V 5 , V djt^j ) . 



(4.6.3) 



/ / 

Prom V djt =V dtf^ 

V a, V ejt =v a, v a,/* =v a, v djs + R{fs,ft)fs 
we get /v a./*> V dJt) = (v a./«- V a./«') ^nd 



V a, V a = (ft,Vo,v gJs ) + {ft, Rih, ft)L ) . 



Since for every vector field V along 7 we have V = V-^ + Vj^^ft and 

therefore (V, V) = (P-*-, P-*-) + the second and third summand in 
Equation (4.6.3) simplify to 



Vvifuft) 



Va/d > Va/. 
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Using the product formula for the term (ft,V g V g fs) we finally 
obtain ' ‘ / 



ds^ 






+ {^ft,VgJs 

+ {ft,R{fs,ft)fs))dt 



V dJs 



and the first equality in the assertion follows since / (O, ■) = 7 is a geodesic 
with (7, 7) ~ 77. 

Since for every smooth piece of the variation / the equation 
holds, the second equality follows from an integration by parts using 



There is an analogous formula for the second variation of the energy 
integral. 

Lemma 4 . 6.7 (second variation of energy). Let 7 6e a geodesic 
and f: [-e, e] x [a,b] M,{s,t) >-> /(s,t) be a continuous, piecewise 

smooth variation of Denote by ti,...,tk 6 (a, 6) the points where 
/(s, •) fails to be smooth. Then the second derivative of E with respect 
to s is given by 

i=l 
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Proof. The first equation follows from 

1^2 (Ms,-), Ms,-)) = (v + (v 9, V 

= (v dtM V dtfs) + (v a, V ajs, ft'^ 

= (v ajs, V dj}j + (v a, V ajs, 

+ {RiMMMM 

= djs,v dj}j + (v djs,ft^ 

- (v V + {R{fs,ft)fs,ft) ■ 

The second equality follows from an integration by parts exactly as in 
the proof of Lemma 4 . 6 . 6 . I 



Lemma 4 . 6 * 8 . Let E2 be submanifolds of M and 7: [a, 6] ^ M be 
a curve from to E2 which intersects both submanifolds orthogonally. 
Assume that f is a (continuous, piecewise smooth) variation of ^ which 
connects E\ with E2- Then 

b 

= {ir^Mb),m),m - {Ii:Ma),^ia)),'y(a)) 

a 

holds, where Ijj. denotes the shape tensor of Ei. 

Proof The assertion follows immediately from the definition of the shape 
tensor and our assumption 7(a) J_ Ei^ 7(6) ± E2. I 

Lemma 4 . 6.8 implies that for a geodesic variation the second deriva- 
tive (respectively, {■&Mf(sr)))^^^J is a quadratic 

form on the space of all variation vector fields along the central geodesic 
7. The associated bilinear form is called the index form of 7 (respec- 
tively, of 7). It is an infinite dimensional analogue to the Hessian of 
a function. 

Definition 4 . 6 . 4 , Let Ei, E2 be submanifolds of M or points in M and 
7 : [a,b] ^ M be a geodesic which connects E\ and E2 and intersects both 
submanifolds orthogonally. Denote by be the space of piecewise 

smooth vector fields along 7 which are tangent to Ei at a and to E2 at 
b. 

The energy index form is the bilinear form 
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^EuS2 ■ ^ ^ 1 ,^ 2 ")' ^ 

(^1,6)^^ I ((v^6,v^^2) + (i?(ei,7)6,7>)dt 

+V,;,(^i(6),6W),7(&)) - (6(a), 6(a)), 7(a)) • 

Ifj is either spacelike or timelike and satisfies (7,7) = rj e {—1, 1}, the 
length index form is defined by 

^si,e 2 '■ '^si,E2'y ^ '^huS2'y ®i 

(6,6) ^ ( (v^6^. V^6^) + (i?( 6 , 7 ) 6 . 7 ))dt 

- »?(-ffri(6(a),6(a)),7(a)) 

Corollary 4 . 6 . 3 . Let j be a geodesic from Si to S2 which intersects 
these submanifolds orthogonally. The index form is positive semi- 

definite if 7 minimises length and negative negative semi- definite if 7 
maximises length. 

Proof For L the assertion follows from the Taylor expansion 

=0 

L{f{s , .) = L(7) + s -))^_^ +J (^^(/(s. -))^_^ + 0(sb- 

The proof for the energy integral is exactly the same, I 

The following lemma summarises in which cases there is a non-trivial 
extremising problem for E and L, In particular, it implies that the ex- 
tremising problem has only in the Riemannian and Lorentzian case non- 
trivial solutions. 

Lemma 4 . 6 . 9 . Let (M,^) be a pseudo- Riemannian manifold, Si, S2 
be submanifolds of M without boundary, and 7 : [a,b] ^ M be a geodesic 
which connects Si with S2 and intersects both submanifolds orthogo- 
nally. If ^ is a null geodesic assume in addition that j(a) ^ Ty(o)^i 
7(6) ^ Ty(6)^2* 

(i) If ^EC^2 positive (respectively, negative) semi-definite then g 

has signature (H h) (respectively, ( )). 

(ii) Let I^C^ bilinear form restricted to 7-^, 

is positive (respectively, negative) definite then either 

- g has signature (H h) (respectively, ( )) 

or 

— g has signature ( — h h) (respectively, (— • • • — \-)) 
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— 7 is causal (respectively^ 7 is spacelike or null) 

— El, E2 are spacelike (respectively, E±, E2 are timelike) at 7(a); 
7(6) 

(in) If the index form Is'i,S2 positive semi-definite then g has 
signature (4 h) or { -). 

(iv) If the index form Isi,E 2 negative semi- definite then either 

— g has signature ( — 1 h) 

“ 7 timelike 

— El, E2 are spacelike at 7(6) 
or 

~ g has signature ( +) 

~ 'y is spacelike 

— El, E2 are timelike at ^{a), 7(6) 

Proof. Let 5 € M \ {0} and v e be a vector with (v,v) — 5 . For 

every fc G N we consider the variation vector field Wk {t) = {t) sm{{t — 

a)^^), where V is the parallel translation of v along 7. This variation 
vector field vanishes at the endpoints of 7 and is therefore in ^^7. 
The equation 

Wk) = / ((v^Wfc, V^W,) + 7)W^fe, 7)) 

J ct 




+ ^ {R{V,^)V,j)sm‘^ (t - o)^))) 

implies that for sufficiently large k the integrand has the same sign as 5 , 
sign(/®;7^(W;t,W^fe))=sign(5). 

Assertion (i) follows immediately from S == {v^v). 

For the rest of the prove we will assume in addition that (7(a), t;) = 0. 
Consequently, we have {Wk{t),'y{t)) = 0 for all t G [a, 5]. 

(ii): Assume that is positive semi- definite and suppose either 

that 

— there is a 2-dimensional subspace of TxM restricted to which g is 
negative definite or that 

— there is a 1-dimensional subspace restricted to which g is negative 
definite and that 7 is spacelike. 

In both cases there is a vector v G T^(^a)M with 5 := (v,v) < 0 and 
V _L 7a. Hence sign(/^’^]^(WA;, Wk)) = -1 and cannot be positive 

semi-definite. This contradicts our assumption, and therefore it follows 
that either 



— g is Riemannian or 
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“ ^ is Lorentzian and 7 is causal. 

Since 7(a) ^ (7(6) ^ and i7i (i^^) is orthogonal to 7 

the submanifold must be spacelike at 7(a) (7(b)). The proof for negative 
semi-definite is completely analogous. 

The index form is only defined for spacelike or timelike geode- 

sics. For large enough k the relation Wk T 7 implies 

and therefore sign(/^^'y^^ (Wfe, Wfc)) = sign(r/J). 

(iii) : Assume that g is not definite. Then v can be chosen such that 

5 — -77. This implies that positive semi definite either. 

(iv) : There is nothing to prove if (M, g) is a 2-dimensional Lorentzian 
manifold. Suppose that either 

- g has not signature (- H h) (respectively ( h)) or that 

- dim(M) > 3 and 7 is spacelike (respectively, timelike). 

Then v can be chosen such that 6 ^ rj and sign{I^^^jj^{Wk,Wk)) = 

sign(r7^) = 1 for large enough k implies that is not negative semi 

definite. ’ I 

Corollary 4.6.4. Let (M, g) be a Riemannian or a Lorentzian manifold 
and 7 : [a, 6] — ^ M be a spacelike or timelike geodesic which connects Ei 
with U2 Ojnd intersects both submanifolds orthogonally, 

ft) is definite then for all variations f: [— e, e] x [a, 6] — ^ M 

with non-vanishing variation vector field there is a 6 e (0, e) such 
that 

~ L{'y) < T(/(s, ■)) Vs e [—S,S] in the Riemannian case, and 

~ L(7) > L(/(s, •)) Vs e [-^, ^] in the Lorentzian case, 

(a) ijf semi-definite, there exist variations /: [— e,e] x 
[a, 6] M of j such that 

— L{'y) > L{f{s,-)) Vs 6 [— €,e] in the Riemannian case, and 

" L(7) < L{f{s,')) Vs G [— e,e] in the Lorentzian case. 

Proof To prove (i) let / be a variation of 7 with non- vanishing variation 

vector field C- Since L(/(s, •)) ^ ^(7) -h 

assertion follows from ^ ^ Lemma 4.6.9. 

For the proof of (ii) let variation fields with 

and ^ < 0. By Lemma 4.6.2 there are variations f± of 7 with 

variation vector fields Now the assertion follows from L{f±{$, •)) = 
L(7) + |^^4;>,(e±,e±) + o(s2). I 

In Lemma 4.6.15 below we will extend this result to null geodesics. 
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4,6.2 Conjugate and focal points 

The existence of conjugate points is closely linked to semi-definiteness of 
the index form in the case that Siy E 2 are points. Before investigating 
this relationship we will need to collect a few important facts about 
Jacobi vector fields in pseudo-Riemannian manifolds. 

Lemma 4.6.10. Let 7: [a, 6] M be a geodesic and J, J be Jacobi 
fields which vanish at some point to € [a, b]. Then we have = 

Proof, It follows from 

(v^J, j)‘ = + (v^J, 

= (il(J,7)J,7) + (v^J, 

that ^ J, j'j is constant. Hence the assertion is an imme- 

diate consequence of J(to) = J(to) = 0 . I 

We show now that it is possible to split Jacobi fields into a tangential 
and into an orthogonal part unless 7 is a null geodesic. Moreover, the 
tangential part is always trivial. 

Lemma 4.6.11. Let 7: [a,6] —> M a geodesic^ ^ be a vector field 
along 7 with ^(t) || 7(t) for all t € [a, b]^ and J be a Jacobi field along 7. 

(i) The vector field ^ along j is a Jacobi field if and only if there are 
numbers with ^(t) = (at + P)'y(t), 

(a) The following statements are equivalent 

(a) (J^(t),7(t)) = 0 for all t e [a, 6], 

(b) there are two different numbers c, d G [a, b] with (J(c),7(c)) = 0 
and (J(d),7(d)) = 0, 

(c) there is a number c £ [a, 6] with (</(c),7(c)) = 0 and 

^V^(c)J,7(c)) =0. 

Proof (i): If C || 7 we can write ^(t) = ^(t)7(t). Since R{(j}{t)'y, 7)7 = 0 

the Jacobi equation reduces to V^V^(^(t)7(t)) — 0 which in turn is 

equivalent to <^(t) = 0. 

(ii) : The assertion follows once we have shown that 

ip(t) := 



satisfies <p = 0. But this follows from 
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= (v^V^J,7^ = (-i?(J,7)7,7) = 0. 



Corollary 4.6.5. Let 'y: [a, b] ^ M be a geodesic which is timelike or 
spacelike and J be a Jacobi field along 7. Then the orthogonal projections 
J~^ to 7 and to 7-*- are also Jacobi fields along 7. 

Proof. Without loss of generality assume that (7,7) = rj e {-1,1}. 
Then J^ is given by rf (J,7) 7* Since 7 is a geodesic we obtain VjJ^ ~ 

7 ] ^V^J,7^ 7 = . In the same way we get 

Prom J^ II 7 we obtain i?(J*^,7)7 = 0 and therefore 

+ i?((J^,7)7 = = ("^7^7-^y 

= (v^V^j)^ + (i?(J,7)7)T = 0 , 

where we have used jR(*,7)7)'^ = 0 (cf. Proposition 4.3.1). From - 
J - J^ and = 0 we get = Va^Va^J. The second 

assertion follows now from 7)7 = i?( J, 7)7, I 

Let 7: [a, 6] -4 M be a geodesic and J be a Jacobi field which vanishes 
at a and c € (a, b). There is a geodesic variation /: (s,i) i-> f{s^t) of 7 
with variation vector field J (cf. Proposition 2.9.1 and Corollary 2.9.1). 
These geodesics pass through 7(a) and ‘‘intersect 7(c) to first order” 
though they may not actually meet this point. Up to second order the 
geodesics segments /(5,’)[a,c] have all the same length as 7|[a,c]- It is 
therefore plausible to expect that 7 will not extremise length beyond 

7(c). 

A typical example where the geodesics meet in both points 7(a) and 
7(c) is given by those great circles of the unit sphere sphere 5^ which 
intersect both the south pole at 7(a) and the north pole at 7(c). Since 
in this example all neighbouring curves intersect 7(c) at an angle differ- 
ent from zero, they can be deformed so that they meet 7(6) and have 
length shorter than 7 (cf. Fig. 4.6.3). This is also true in the general 
case, even if the neighbouring geodesics do not actually intersect 7(c) 
(cf. Theorem 4.6.1). 
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For arbitrary submanifolds Ei, S2 an analogous statement cannot 
be formulated. There is simply no canonical way to compare different 
submanifolds along a curve. Hence we cannot speak of a “first point 
where a given normal geodesic fails to minimise length between Ui and 
272”. However, we can ask at which point a geodesic 7 orthogonal to 
a submanifold S fails to minimise distance from U. In order to answer 
this question we will need to generalise the concept of a pair of conjugate 
points to a pair which consists of a submanifold and a point. 

Definition 4 . 6 . 5 . Let U C M be a submanifold and 7: [a, 6] ^ M be 
a geodesic with 7(a) € U, 7(a) e \T^(Q,)i 7 . The point 7(c) is 

called a focal point of E along 7 if there is a Jacobi field J along 7 with 

(i) J{a) e J{p) = 0 ; 

(ii) (v^J{a),v^ + {][{J{a),v),j{a)) = 0 for all v e 

Observe that in the case that E is just a single point, condition (ii) is 
empty and the definition reduces to the definition of a pair of conjugate 
points. The following lemma explains why we also demand condition (ii). 

Lemma 4 . 6 . 12 . Let E be a submanifold of M and 7: [a, 6] M be a 
geodesic with 7(a) G E, 7(a) G \ T^{a)E, 

If f: (— €, e) X [a, 6] — > M is a variation of 7 through geodesics or- 
thogonal to E then the variation vector field ^ satisfies 

(i) ^ is a Jacobi field f 
^ 'E^{a)E 

(Hi) ^V^(a),?;) + {I{^(a),v),j(a)) = 0 for all v G T^(a)^. 

Conversely, let ^ be a vector field along j which satisfies (i)-(iii). Then 
there is a variation f: (— e, e) x [a, 6] M of j through geodesics or- 
thogonal to E which has variation vector field 

Proof. Assume that / is a variation through normal geodesics. 

That ^ is a Jacobi field follows from Proposition 2 . 9.1 and property (ii) 
follows from f{s, a) G E for all s. The equation 

implies that for every vector field V tangent to E 

=0 

(v^e, v) = (v dJuV^j =V {ftJd/) 

= -{ft,Hfs,v)). 



holds. This proves (iii). 
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Let fx: (~e,e) ^ be a curve with /i(0) = ^(a) and V be 
a vector field along /x with V(s) ± eind y(0) = 7(0). We de- 

fine /(s,t) = exp(ty(s)) thereby obtaining a variation of 7 through 
geodesics normal to E. This variation has variation vector field ^ if and 
only if fa(0, a) - ^(a) and ~ clearly have /^(O, a) = 

A(0) = C(a) for any choice of V. Since = V jjt = fi{Q)V we 

have to choose V such that == '^7(0)^- always 

possible, let V (s) = P/i|[o_^]7(«)+sP;i|[o^^] (^7(a)^) (^f- Lemma 4.4.5). 

Then we have V^(0) = 7(a) and, using Lemmas 4.4.4 and 4.4.5, 

=0 

r ^ 

= (V;i(0)Pi|[o,,]7(«))^ - •),7(a))“ + (V7(a)$)^ 

~ ^7(a)^’ 

where the last inequality follows from 

- (iu.,w).t,) = -(i gWjt'j = (wv ajt) 

for all vector fields W which are tangent to L*. I 

If 7 is not a null geodesic then the neighbouring geodesics provided by 
Lemma 4.6.12 are of the same causal type as 7. However, if 7 is null 
then this property is not guaranteed. The following lemma clarifies the 
situation for null geodesics. 

Lemma 4.6.13. Let E be a submanifold of M and 7: [a, 6] — > M 6e a 
null geodesic with j{ci) € E, 7(a) G (Ty^a)^)'^ \ ^7(0)^* 

Let ^ be a vector field along 7 which satisfies properties (i)-(iii) in 
Lemma Then there is a variation f: (— e,e) x [a, 6] M of j 

through null geodesics orthogonal to E and with variation vector field ^ 
if and only if {({t),'y{t)) = 0 for all t G [a, ?>]. 

Proof. Assume that / is such a variation through null geodesics. The 
equation {ft, ft) = 0 implies 0 = ^Vg^/f,/t^ = and 

therefore ~ Lemma 4.6.12 (ii) and Lemma 4.6.11 (ii) 

imply = 0 for all t G [a, 6]. 
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Conversely, assume that ^ is a vector field along 7 which satisfies (i)- 
(iii) of Lemma 4.6.12 and is orthogonal to 7 . We construct the variation 
/ as in the proof of Lemma 4.6.12 but now we will choose V so that it is 
null at every point of /j.. To do so, let s W (s) e T^(^a)M be a curve with 

VF(0) = 7 (a) and (W(s), W(s)) = 0 for all s. As in the proof of Lemma 
4.6.12 let ((/; s H- > |u(s) 6 17 be a curve with /it(0) = 7 (a) and /1(0) = 
^(a). We set V{s) = Clearly, y(s) is normal to 17 at all s 

and v^(y(s),y(s)) = 2(^v^y(s),y(s)J) = 2^(v^y(s))'^,y(s)^ = 

0 implies {V{s)^V{s)) ~ 0 for all s. Hence f{s,t) = exp{tV{s)) is a 
variation of 7 through null geodesics normal to E, We have to choose W 
such that the variation vector field of / coincides with Since we have 
/s(0j <^) = A(0) “ C(O') we only have to arrange W such that 

V a/*(0,a) = 

Prom - (J(/s, = (^ 7 (a)^) Proof of Lemma 4.6.12) we 

obtain 

(va,/,) =(va./.) =(v^v'w)„, 

V /(0,a) V /(0,a) ^ ^ 

- (v« 0 ) + (^7(a)«)" 

We have ^V.yCi 7 («)^ = 0 since ^ is orthogonal to 7 (Lemma 4.6.11). 

Since 7 (a) 1 T.y(a)17 we get therefore ^ , 7 (a) ^ = 0. The tangent 

space of the null cone C T^(a)M at the point 7 (a) is just {X e 

27y(o) : {j{o-),X) = 0}. Hence (v..yC) is a tangent vector to C^(a) at 
7 (a) and we can choose W such that 

implies V a) — and we are done. I 
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Fig, 4.6.3, Conjugate points on the 
sphere 



We need another technical lemma analogous to Lemma 4.6.10 

Lemma 4.6.14, Let E be a pseudo- Riemannian submanifold of M and 
[a^b] M be a geodesic which intersects E orthogonally at 7 (a). If 

JiyJ 2 Cive Jacobi fields which satisfy Ji{a) e ^'^d (VjJi{a)^v^ + 

(F( Ji(a), u), 7 (a)) = 0 for all v C then Ji, J 2 satisfy 

J2W) 

for all t G [a, 6 ] . 

Proof, Prom the proof of Lemma 4.6.10 we know 

((v^Ji, J2) - ( Ji, V^J2)) = 0. 

Hence the assertion follows from 

= - (J(Ji(a), J2(a)),7(a)) + {I{J2{a),Ji{a)),j(a)) = 0. 



We can now present a theorem which links focal points to length extrem- 
ising geodesics. 

Theorem 4.6.1. Let (M^g) be a Riemannian or a Lorentzian manifold, 
E be a Riemannian submanifold, andj: [a, 6 ] — > M be a geodesic which 
intersects E orthogonally at 7 (a). If{M,g) is Lorentzian we also assume 
that 7 is timelike 
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(i) The submanifold U does not have focal points along 7 if and only 

if 



~ the index form is positive semi- definite in the Rieman- 

nian and negative semi- definite in the Lorentzian case, 

~ exists a function vq with V ~ 

voj, i,e., V corresponds to a reparameterisation of'j. 

(ii) The point 7(6) is the only focal point of U along 7 if and only if 
— the index form is positive semi-definite in the Riemannian and 
negative semi- definite in the Lorentzian case, 

“ there is a non-vanishing vector field V: [a,b] ^ M along 7 which 
satisfies V) = 0 and has values in 7’^. 

(Hi) There is a focal point 7(c) of E along 7 with c < b if and only 
if the index form is not semi-definite. 



Proof. Observe that the cases (i)-(iii) are mutually exclusive and com- 
prise all possibilities. Hence it is sufficient to prove the “if’’-parts only. 

(i): Since there are no focal points of E along 7 there are n — 1 

linearly independent Jacobi fields Ji : [a, b] — ^ TM which 

(a) are everywhere perpendicular to 7, 

(b) satisfy Ji(a) € T^{a)^ a,nd (VjJi{a),v'^ -f (l(Ji(a),u),7(a)) = 0 
for all V € 

(c) form a basis of 7(;^)“^ for every parameter value t G (a, 6]. 

Let y be a variation vector field which is tangent to at a and 
vanishes at b. Because of (c) there are smooth functions : (a, 6] — > M 
{i = 0 , ... ,n — 1 ) with V — -h v'^Ji, We will now calculate 

= + (v^Ji + vWjJi, VkJk) 

~ + (v"Ji,VkJk) 

Lemma 4.6.14 ^ ^ 

+ (i) i^kJk} 

= (w^V^,v’‘V^Jk) + (v^Ji,V^ (vkVA^Jk)) 

= + {V^,R{V^, 7 ) 7 > 

+ (^V Ji^ ) 
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where in the last equation we have used that Jk are Jacobi fields. Hence 
we get 

= {v^Ju^Ji) + (v^.v^V^Jk )' . 

Assume for the moment that the functions have continuous extensions 
to a. Then (b), V{b) ~ 0 , y-^(a) ~V{a) G and the fact that all 

v'^ have bounded extensions to a imply 

pb 

= (J(y(a),t;‘=(a)4),7(a)). 

Writing rj = (7, 7) we obtain 

4;U)}(^V) = ((v^y^ v^y^) + (i^(y^7)^^7) di 

-77(J(y(a),y(a)),7(a)) 

pb 

= V (i'Vi,vVi>dt + 7/ (l(y(a),v^ (a) Jfe), 7 (a)) 

J a 

-r/(J(y(a),y(a)), 7 (a)) 




Since 7 is timelike in the Lorentzian case the vector field V is always 
spacelike and we can conclude that is semi-definite. Fur- 
thermore, = 0 if and only if all are constant, which 

in turn implies that V-^ vanishes since it vanishes at b. 

We still have to show that the functions have continuous extensions 
to a. There are dim(Z’) vectors • • • , which form a 

basis of This follows from Lemma 4 . 6.12 since it is possible to 

construct normal geodesic variations of 7 in any direction tangential 
to S, We can also assume that Jk{o) = 0 for all k ^ {h, • • • ,Mim(i:)} 
because otherwise we could subtract a suitable linear combination of the 
Ji, from Jk- For each t we decompose V{t) into a part U tangential to 
span{Jij(t), . . . , (^)} and a part U tangential to span{ JA:(t) : k ^ 

{^ 1 ) • • ' 5 ^dim(r)}}* is clear that all have smooth extensions to a since 
{Jij(t), . . . , are linearly independent for all t G [a, 6]. Since 

U = v~v^^ Jij and Jfc (A; ^ {ii, . . . ,idim(r)}) smooth and vanish at a 
there are vector fields W, Kk such that U — {t~a)W and Jk = (t—a)Kk- 
These vector fields satisfy W{a) = V^[/(a) and Kk{o) = V^JA;(a). From 

fjL „ v^Jk we get = v^Kk^ and from the linear independence of 
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{t G (a, 6]) and the fact that Jk(a) = 0 we infer that the vector 
fields Kk are linearly independent near a. Hence there are smooth one 
forms w* along j with u)^(Kk) = near a. Consequently, the function 
v'^ = (near a) has a smooth extension to a. 

(ii): Let {6j} > 6 be a strictly monotonically increasing sequence 

and y be a vector field along 7 which is tangent to at a and vanishes 
at b. Since V vanishes at b, there exists a well defined vector field W 
such that y(f) — {b — Let y be the piecewise differentiable 

vector field given by y(t) = {bi - t)W(t) for t G [a,6i) and y(t) = 0 
for t G [bi,b\. This gives a sequence of piecewise smooth vector fields 
{Vi} with y[a_6.] = V\^a,bi] ^[L.6] = 0. Since b is the first conjugate 
point part (i) of the theorem implies that ^ 0- Hence 

(i oo) impfies that is 

positive semi-definite. To see that there exist non-trivial variations with 
vanishing index form let J be a Jacobi field orthogonal to 7 according 
to Definition 4.6.5. Since J = J-^ we have 

pb 

-??(J(J(a),J(a)),7(a)) 




-r,{I{J{a),J{a)),^{a)) 

= r?(J(J(a), J(a)),7(a)) - rj{I{J{a),J{a)l^{a)) = 0 



(iii); Let c G (a, 6) be the first focal point of E along 7 and t 
J be a non-vanishing Jacobi field according to Definition 4.6.5. Then 
limt_c Va^J (t) ^ 0 and the piecewise smooth vector field 

= fortG[a,c], 

1^0 fortG(c, 6] 

satisfies ZlV..yy(c) = - limi^^ 0. Let J > 0 and W be a vector 

field along 7 which satisfies 

W{a) = W{b) = 0 , = 0 , (W(c), AVa^V{c)) > 0 . 

By the definition of J we have y) = 0 and I^’l (y, W) = 

—2r) {AV (c), W (c)). This in turn implies 
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= -2,J {2SV(c), W{c)) + W M')- 

Hence for S sufficiently small we obtain sign( ^ (y -}- 5 W^ V +<5W)) = 
sign(— 77J). Since we can replace W by —ly this equation implies that 
•^^’{7(6)} semi-definite. I 

In order to have an analogue of Theorem 4.6.1 for null geodesics we need 
to use the energy index form. 

Theorem 4,6.2. Let {M,g) he a Riemannian or a Lorentzian mani- 
fold, E be a Riemannian submanifold, and 7 : [a, 6] — > M be a space- 
like (Riemannian case) or causal (Lorentzian case) geodesic with 7(a) € 

(^7(a)^) • 

(i) The submanifold E does not have focal points along 7 if and only 

if 

£» '7 _L ‘ ' 

— the index form is positive semi- definite, 

~ ^ implies that there exists a function vq with 

V ~ uo7, ie.; V corresponds to a reparameterisation of 

(ii) The point 7(6) is the only focal point of E along 7 if and only if 

— the index form is semi-definite and 

“ there is a non-vanishing vector field V : [a,b] M along 7 which 
satisfies ^ values in 7'^. 

(Hi) There is a focal point 7(c) of E along 7 with c < b if and only 
if the index form semi-definite. 

Proof. If 7 is timelike (in the Lorentzian case) or spacelike (in the Rie- 
mannian case) then the proof is completely analogous to the proof of 
Theorem 4.6.1. We can therefore restrict to the case that 7 is a null 
geodesic and E is spacelike. 

(i): Again, there are Jacobi vector fields which satisfy assertions (a)- 

(c) in the proof of Theorem 4.6.1 (i). Since every Jacobi field satisfying 
(a)-(c) must be a linear combination of the Ji we can assume without 
loss of generality that Ji{t) — tj{t). Exactly as in the proof of Theorem 
4.6.1 we obtain for every vector field V along 7 with V (a) € Ty^a)E and 
V{b) = 0 

J (X 

where t ^ v^{t) are functions defined by V{t) ~ YhZi *^^(^)^i(f)- The 
equation {Ji,j) = 0 implies that each Ji must be spacelike or null, 
whence (^> > 0. Moreover, the Jacobi field Ji is spacelike for 
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i > 2, By the linear independence of the Jacobi fields at every point ^ a 
we have ^ unless = 0. Since V{b) = 0 

the equation y) = 0 implies that = • • • = ^ = 0 and 

therefore V = v^Ji which is parallel to 7. 

The proof of assertions (ii) and (iii) is completely analogous to the 
proof of Theorem 4.6.1 (ii), (iii). I 

We can now extend Corollary 4.6.4 to null geodesics in the case that one 
of the submanifolds U2 degenerates to a point. This is achieved by 
showing that there is a variation of 7 through timelike curves from the 
spacelike submanifold S to 7(6) if is not semi-definite. Observe 

that we cannot use the same argument as in Corollary 4.6.4. While we 
would obtain the existence of a variation / with £^(/(s, •)) < 0 for all s, 
it would not be clear that these varied curves are everywhere timelike. 

Lemma 4.6.15. Let {M,g) be a Lorentz manifold^ E be a spacelike sub- 
manifold, and 7: [a, 6] “> M be a null geodesics which intersects E or- 
thogonally. not semi-definite then there is a timelike curve 

from E to 7(6) arbitrarily close to 7. 

Proof. The strategy of proof is as follows. Theorem 4.6.2 implies that 
there is a first focal point 7(c) {c G {a, b)) of E along 7. For some small 
S G (0,6 — c) we will construct two vector fields ^{t) and A{t) along 
7|[a,c+<5] such that for every variation f of j with /s(0,t) = ^{t) and 

f 

V ds(0,t)f^ - {ft{'t,s)Ji{t,s)) < 0 for s > 0 sufficiently 

small. We will show that there is such a variation which, in addition, 
satisfies f(s,a) G E and f{s,c + S) = 7(0 + J) for all s. It is then possible 
to join E and j{c + ^) by a timelike curve arbitrarily close to 7. This 
curve can in turn be slightly deformed in order to arrive at a timelike 
curve from E to 7(6). 

Observe that A cannot be chosen completely independently of In 
fact, at 7(a) € Z" we have must have A-^ = 9^/^^ = ^ifs.fs) ~ 

I^oui the proof Lemma 4.6.7 we see that 

I (^ + (v a, V dsfs^f) 

= - (v V + -R(A- ft) ft’ /*) 

+ (V djs, fs^j + (V V dj^’ft) 
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holds. If we can construct vector fields A with 

(v^v^C + > 0 - - (v^^,7) > 0 

then any corresponding variation / satisfies {ft, ft) <0 for small s > 0. 

Let 7(c) (c € (a, 6)) be the first focal point of E along 7 and let J 
be a Jacobi field according to Definition 4.6.5. It follows from Lemma 
4.6.11 (ii) that this Jacobi field is everywhere orthogonal to 7. 

If there would be a point d € (a, c) with J (d) = aj(d) then the Jacobi 
field J(t) - would have a zero at d and satisfy all the conditions 

of Definition 4.6.5. Hence there would be a focal point 7(d) of E along 
7 before 7(c) in contradiction to the definition of c. We have therefore 
shown that J{t) € (7(t))-^ \®7{t) for all t € (a,c). 

The derivative of J at c satisfies V^J{c) € T^(c)M \ E7(c) since J 
is non-trivial and not parallel to 7. This implies that there is a J > 0 
such that c is the only point in (o, c + J] where J is tangent to 7. Hence 
there exists a spacelike vector field U along 7 with value in (7)-*- and a 
function (p : [a, 6] — > R such that 

- {U (t), U (t)) - 1 for all t £ [a, c + J], 

- J{t) = (p{t)U{t) for all t e [a,c + d], 

- ‘p{t) > 0 for all t e (a, c). 

- ip{t) < 0 for all t e {c,c + d). 

We will now construct ^ by slightly stretching U. Let tp: [a, c 4- J] -> R 
be a function and consider ^ = (p + tp)U. Prom 

7)7 = W^VAfiipU) + tpR{U, 7)7 

- ^17 + 2i,WAp + + R{U, 7)7) 

We get 

( v.y + R{t 7)7, = + + (^7^7^ + u)) . 

There is a number Ai > 0 such that -(Ai)^ < + R{U, 7)7, 17^ 

for all t 6 [a, c H- (5]. Let A2 > 0 and ipit) ~ A2(e'^^’^ -- Then we 

obtain 

^ + V>(V.yV.yC/ + R(C/,7)7,C/) 

= ((Ai )2 + (V^V^CI + R{U, 7)7, U)) + A 2 (Ai) 2 e^i“ 

> A2(Ai)2e^i“ > 0 

for all t € (a, c+5). We set A2 = -e-^i") which is pos- 

itive. Observe that ‘^{a) = 0, p{c + 5) + tp{c+5) = 0, and (p{t) + ip{t) > 0 
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for all t € (a, c] . We can assume that c-\-5 is the first zero after c since oth- 
erwise we could replace (5 by a smaller number. To summarise, the vector 
fields satisfies ^(a) = J(a), = 0, and -f > 0 

for alH € (a,c -f 5). We will now construct the acceleration vector field 

There is a basis ei, . . . , en-i of (7(^))*^ such that 

(i) en-^i = 7(<x) 

(ii) span{ei, . . . , 

(iii) {ei, 6k) = 5ik for alH E {1 , . . . , n 2 }, fc € { 1 , . . . , n - 1}. 

Let en 6 T^(^a)M b® fbe null vector with (cn, ^n-i) — -1 a.nd (e^, e^) = 0 
for alH € {1, . . . , n - 2}. We denote by e^(t) (^ € {1, . . . , n}) the parallel 
transport of along 7 (i 6 n}). There are numbers (k E 

{dim(i:) + 1, . . . ,n}) such that J(^(a),^(o)) = ELdim(r)+i Let 

fc=dim(i7)+l 



where 

rt<) = {im.UyM} + («..v^(„)£)) - ({.. v^(,)£). 

We have /x(a) = Ca), 7(^i)} = a,nd 

fi{c + 5) = 1 ^7(c + 5)^) ~ 2 ^ 7(0 + <5) 0 

=0 

= (y' + + ^)|c+5 = 0. 

Hence the vector field A satisfies A{a) = A{a)-^ = S{^a)ia) and A{c + 
5 ) = 0 . We will now show that (v^A, 7^ + (^7^’ calcu- 

late 



v^At) = 



n— 1 



\A!=dim(i;)+l 
I n~l 

\ V c + <5 ~ a 



c 8 — t 



^ \c + 5 — a 
c d — t 



C/c I ^ 'y ) ^n~l 



fl 






^a)j 7(n)) + . . 

(£.. v^«)<) 



and our claim follows from 
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= {J^{Ja,Ja),i{a)) + 2^7(a) (^>0 

= ( V d fs, ft) + <t>{a){i){a) + ^{a)) 

\ / |(tt,0) 

=^-[fs, + 4>{a){i’{a) + 0(o)) 

— -(p{a)<j){a) -f 4){a){4){a) + <p{a)) — (f){a)'ip{a) > 0. 

We will now show that there is a variation /: (— €,e) x [a,c + 5] — > M 
such that 

1- /s(0,t) = J(t), ^ ^ Kc + J], 

2. /(5,a) e for all s G (-e,e), 

3. /(5,c4- (5) = 7 (c + (5) for all a G (-e,e). 

By construction of ^ and A such a variation must satisfy {fuft) < 0 
for all s ^ 0. Hence we will obtain timelike curves from S to 7(0 + 5) 
which are arbitrarily close to 7 which in turn implies that there also exist 
timelike curves from E to 7(6). 

Let jjL\ (-e,e) be a curve in E with fi{Qi) = 7(a), /i(0) = ^(a), and 
V^(0)A = 0 where V is the Levi-Civita connection of E, Let / be a 

variation such that /(s, a) = fjb{$) and /(s, c + (5) = 7(0 + S) for all 
s G (-e,e). For e sufficiently small there is a smooth map V : (s,t) 1-^ 
y(s,t) G T^(t)M with exp^(i)(Z(5,i)) = where Z(0,t) = 

We will now modify this variation so that the modified variation has 
variation vector field ^ and acceleration vector field A, Restricting the 
equation dsexp{Z{s^t)) ~ Tz(s,t) to s = 0 implies that 

the variation vector field ^ of / is given by In order to cal- 
culate the acceleration vector field A let {or^, . . . . . . ,y^} be a 

coordinate system of TM such that y^{v) = for any vector v. For any 
given V G T^M let A be the geodesic given by A(s) — exp^(su). Then the 
calculation 

= ^ i^Wxisv)) + r;fcA^(s)A*=(s) 
implies (^) = follows that the equation 
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holds and completely analogously that the variation 

f{s,t) = exp^(t) (^Z{s,t) + s{^{t) - i{t)) + y (4(0 - 4(0)^ 

has variation vector field ^ and acceleration vector field A. By construc- 
tion, each of the curves t i— ^ /(s,t) starts in S and ends in 7(0 + 5 ), 



4.6.3 Existence of focal points 

The existence of focal points depends on three factors, the curvature of 
the pseudo-Riemannian manifold near 7, the length of 7, and the shape 
of the submanifold E, A typical result is the following. 

Proposition 4.6.1. Let E be a spacelike hypersurface and 7; [a, 6] ^ 

M be a geodesic with (7,7) — V ^ {— 1 , 1 } and 7(a) € If 

Ric(7(t),7(t)) > 0 for all t and the mean curvature vector field H of E 
satisfies (iJ^(^),7(a)) =: c > 0 , then there is a focal point of E along 7 
before j(a + 1/c), provided c> l/(b — a). 

Proof Let (ei, . . . , Sn-i} be a basis of Ei the parallel trans- 

lation of 6 i along 7. The vector field ^i{t) := {1 + ca - ct)Ei{t) vanishes 
at a + 1 /c and is tangent to Z* at t = a. Since 

^”1 ra+l/c 

= ^ Z + / (1 + ca - ctf (R(Ei,j)Ei,j) dt 

i=i 

- (j(a),I(Ei(a),Ei(a))}) 

= Tjun - l)c - / (1 + ca - ct)'^Ric{'y{t),j{t))dt 

-(7(a),(n-l)if^(„))) 

/*0+l/ C 

= — 7 ] {1 + ca - cit)^Ric(7(t),7(t))dt 

J a 

we have found a vector field ^ with negative index in the Riemannian 
and positive index in the Lorentzian case. If follows that 
is semi-definite or indefinite and therefore the existence of a focal point 
before 7(a + 1 /c) I 
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The inequality given in Proposition 4.6.1 is sharp. Consider a sphere 
of radius r and with inner normal n. Then its mean curvature vector 
field is given by ~ ~n. The geodesic 7 with 7(a) = Ux satisfies 
{Hx^ i{ci)) — ~ and intersects the centre of the sphere at 7(a-hr). Clearly, 
the centre is the first focal point of the sphere along 7. 

There is an analogue of Proposition 4.6,1 for null geodesics and space- 
like submanifolds of codimension 2. This analogue will become important 
in Chap. 9 on singularities in general relativity. 

Proposition 4.6.2. Let E he a spacelike submanifold of dimension n— 2 
and'y: [a,b] b be a null geodesic with 7(a) e If 

Ric(7(t),7(t)) > 0 

for all t and the mean curvature vector field H of E satisfies 

=: C > 0, 

then there is a focal point of E along 7 before j(a + 1/c); provided c > 
1/(6 -a). 

Proof. The proof is analogous to the proof of Proposition 4.6,1. We 
choose a basis {ei, . . . , 6^-2} of There are a spacelike unit vector 

and a timelike unit vector both orthogonal to E such that 
7(a) — e^_i + 6n- We denote the parallel translation of ejt along 7 by 
Ek . Then we have 

n-2 

Ric(7,7) = ^ + (R{En-i,j)if,En-i} - {R(En,j)j,E„} 

i~l 

n—2 

” ^ 7)75 + {R{En-l, En)En^ En~l) 

i=l 

— (R(En^ En~l)En~l) En) 

n~2 

= J^(R(Ei,j)j,Ei). 

i=l 

This implies 



E ^s,{yfa+l/c)}((^ + ca - (1 + ca - Ct)Ei) 



/ /'tt+l/c 

= XI ( *^ + / (1+ca-ctf (R(Ei,j)Ei, 7) dt 

i=i \ •'0- 



- {j{a),M{Ei{a),Ei{a))) 
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= (n - l)c - f 
J a 



a+l/c 



{l + ca- ct)^Ric(7(i),7(i))di 



a-j-l/c 






(1 + ca “ ct)^Ric(7(t),7(t))dt. 



Hence positive semi-definite and therefore there is a 

focal point before j{a + 1/c). I 



It is also possible to prove the existence of conjugate points of geodesics 7, 
if they are sufficiently long and if suitable curvature conditions hold along 
7 (cf. Proposition 4.6.3 below). This result is central to the singularity 
theorem which is presented in Chap. 9. Since in the case of a single 
geodesic we do not have initial conditions which guarantee focusing in 
one direction, the proof of Proposition 4.6.3 will be much more delicate 
than the proof of Propositions 4.6.1 and 4.6.2. The rest of this section 
will be devoted to proving the following proposition:^ 

Proposition 4*6*3. Let (M^g) be a Riemannian or Lorentzian mani- 
fold and ^ be a complete geodesic which is spacelike in the Riemannian 
and causal in the Lorentzian case. //Ric(7(t),7(t)) > 0 for all t and if 
there is a to such that the map 

■R: (7(^o))‘‘' (7(^0))’^ , Rv:= R{v,^)^ 

is not identically zero. Then 7 has a pair of conjugate points. 

The proof of this proposition requires more results from the theory of 
conjugate points. 

We have seen before that there is an (n — l)-dimensional subspace 
of Jacobi fields along 7 which have values in (7)-^ and vanish at a given 
point. In the case that 7 is a null geodesic, there is always a linear com- 
binations of these Jacobi fields which is equal to the trivial Jacobi field 
1 1— ^ (at+P)j{t). This uninteresting subspace disappears if one considers 
the factor space (7(t))^/R7(t) = {[v] : v € (7(t))^,^ w v — w || 
7(t)} instead of the orthogonal complement {'y{t))-^. This space coincides 
with the orthogonal complement if 7 is timelike or spacelike. If 7 is a null 
geodesic then working with the factor space has two advantages: As we 
have indicated above, [(ai -I- (3)j{t)] = [0]. More importantly, the metric 
g induces a metric [g] on (7(^))'^/7(t) which is positive definite instead 
of degenerate. 



® We follow closely the presentation in (Beem and Ehrlich 1981). There is 
also a much shorter proof in (Hawking and Ellis 1973) which I failed to 
understand. 
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Definition 4 . 6 . 6 . Let 7: [a,6] M be a geodesic and t G [a, 6]. Two 
vectors v^w € {'y{t))^ are called equivalent, v w, if there is a number 
a e R with v = w + a^yit). We denote the space of equivalence classes 
{[v] : V € (7(0)-^} by [ 7 (f)]^ and set [7]-^ = Ue[a, 6 ][ 7 (i)]'^- 

Amap[^](t): [7(f)]-^x---x[7(t)]-^x^[7(i)]-^^ x • • • x (^[7(i)]-^ j -4 
R along 7 which is linear in each of its entries is called a tensor class 
along 7. 

From the definition it is clear that any tensor ^4 of {'y{t))'^ induces a 
tensor class [A] at t via 

[^KK], • • • , bs], - Ivl) = ^('^1, .,¥>’■) 

where is defined by [¥>*]([«]) = ip\v) for all v e (In particular, 

(^^(7) = 0). Conversely, any tensor class is induced by a tensor in this 
way. 

The metric and the covariant derivative in direction 7 induce analo- 
gous objects for tensor classes. 

Lemma 4 . 6 . 16 . Let j be a geodesic and [A] be a tensor class along 7 
and A be any tensor field along 7 with [A] = [A], Then [A] VjA is 
well defined. 

If 'j is a null geodesic then the metric [g]: [7]-^ x [7]-^ — > R, [o,w] t— > 
bKH) M) •“ well defined and positive definite. 

The operator [R]: [7(t)]-^ ^ [ 7 ( 01 "^? M [■^('^^?7)7] defined. 

Proof For any one- form (p satisfying p{j) =0 we have 

= ^j{t) i'Pivi*) + mam - <p + mm) 

= 

- 'f {y^{t)yi*) + d/7(i)7(f) + /V^(i)7(i)) 

= ^^{t){v{y{t)) - V {^j(t)yii)) = ^j<p{y{t)), 

where we have used that V^7 = 0. For any vector field V along 7 we 
have [V^(V(t) + fmm = + d/(7)7 + /V^7] - 

Hence the first assertion holds for 1-forms, vector fields, and (trivially) 
for functions along 7. Since is a derivation this proves the first claim. 

The second claim follows since there is a basis ei, . . . , of (7(t))"^ 
such that g{ei, Cj) = 6 ij for i,j e {1, . . . , n - 2} and gien-ijCk) = 0 for 

k G — 1}, 

The third assertion is a consequence of R{j, 7)7 — 0. I 




4.6 Length and energy functionals 245 



Definition 4.6*7* A Jacobi tensor class is a tensor class [A]: [7]-^ — > 
[7]-^ along 7 for which [A] + [R][A] = [0] holds. 

Lemma 4.6.17. A tensor class [A]: [7]-^ ^ [7]-^ along j is a Jacobi 
tensor class if and only there is a tensor field A along 7 which induces 
[A] and has the property thatt i-*> AV(t) is a Jacobi field for every parallel 
vector field V with values in (7)"^. 

Proof Suppose that A is a tensor field along 7 such that AV e (7)-^ is a 
Jacobi field for any parallel vector field V with values in (7)"^. It follows 
immediately from 



(v^v^[^] + [i?][^]) y] 



■(v^V^a) T/] + [R{AV,m] 
V^^^{AV) + R{AV,m] 



that [A] is then a Jacobi tensor class. Conversely observe that there 
are exactly (2n — 2) linearly independent Jacobi vector fields Ji along 
7 which have values in (7)"^. Let {£’1, . . . , £n-i} be a parallel frame of 
(7)-^ and define the tensor field A: (7)-^ (7)-^ by Av ~ Alv^Jj 

X * * 

where v = ^J^d the A\ are (constant) real numbers. It is 

clear that for every parallel vector field V G (7)“^ the vector field AV 
is a Jacobi field with values in (7)"^. Hence [A] is a Jacobi tensor class. 
Since the differential equation [A] + [£][i4] = 0 implies that the space of 
Jacobi tensor fields is 4(n — 1)^- dimensional if 7 is timelike or spacelike 
and 4(n — 2) ^-dimensional if 7 is null every Jacobi tensor field can be 
generated by some tensor field A as constructed above. I 



It is clear from the proof of Lemma 4.6.17 that the columns of a Jacobi 
tensor class with respect to a parallel basis of (7(t))"^ are just Jacobi 
fields expressed in this basis. 

Corollary 4.6.6* Let 7: [a, 6] — ^ M be a geodesic and to,ti G 
[a, 6] {toAh)‘ 

(i) For any pair of tensor classes 

[^0] : mo)]"- - [TCio)]-" , [io] : [7(^0)]^ ^ [7(^0)]^ 

there is a unique Jacobi tensor class [A] with [-A](^o) = [^o] cb'fid 
[A] (to) = ^0; 

(ii) Assume that 7 does not have conjugate points and let 

[^0] : [7(^0)]^ - [7(^0)]^ , [Aa] :■ [7(<i)]^ - [7(^1)]^ 

be a given pair of tensor classes. Then there is a unique Jacobi 
tensor class [.4] with [^](fo) = [-4o] and [j 4 ](^i) = [Ai]. 
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Proof, The assertions follow immediately from Lemma 4.6.11 and Propo- 
sitions 2.9.2, 2.9.4. I 

The following lemma is clear from the definitions and the fact that a 
non- vanishing Jacobi field which is parallel to 7 has at most one zero: 

Lemma 4.6*18. Let j he a geodesic. Two points 7(c), 7(d) are conju- 
gate if and only if the Jacobi tensor class [A] which satisfies [^](c) == [0], 
[yl]](c) = id is singular at d. 

Definition 4.6.8. Let j be a geodesic and [B]{t): [7(f)]“^ ^ [7(^)]“^ be 
a tensor class along 7. Then the adjoint of [B] with respect to [g]^{t) 
denoted by [B]* . 

Lemma 4.6.19. Let [A] be a Jacobi tensor class along a geodesic 
7: [a, 6] — ^ M and assume that there is a number to G [a, 6] with 
[A] (to) = [0]. If [A] is non- singular at t then the tensor class [A][A]^^ is 
self-adjoint at all t. 

Proof Let V, W be parallel vector fields along 7 with values in The 
equations 

v^( (av, - (^a^AV, aw') ) 

= + (aV.^a^Va^Aw) 

- (Va^Va^AV,Aw') - (v^Ay, 

= {-R{AV,^)^,AW) - (Ay -iZ(APy 7)7) = 0 

and A(io) = 0 imply that (Aw, '^a^Aw) = (Va^Av, Aw) for all vectors 
v^w € (7)'^. It follows that 

(v^AA-^r;,u;) = (Va^A{A-\),A{A-^w)) 

= (A(A-iv),V^A(A-ity)^ 

= (v,S7a^AA~^w) . 



The self-adjoint tensor class [A][A]”^ has a direct geometrical inter- 
pretation in terms of congruences of geodesics. Let 7: [a, 6] — > M be 
a spacelike (in the Riemannian case) or timelike (if M is Lorentzian) 
geodesic^ and /: x R M, (s\ . . . , t) 1-^ /(5^-, . . . , s'^~^,t) 



^ The interpretation in for lightlike geodesics is slightly less direct. 
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be a smooth (n — 1) -parameter geodesic variation of 7. We may assume 
that each geodesic t f{$^^ . . . , satisfies {ft^ ft) € {-1,1} and 

that at t = a the vectors . . . ,/sn_i} are linearly independent. 

Then U = . . . , is a well defined vector field near 7(a). 

The covariant derivative of U geometric properties of our congruence of 
geodesics. The function 0 = div(C/) measures the divergence of neigh- 
bouring geodesics. Analogously, = dU^ is the infinitesimal rotation 
and cr, the traceless, symmetric part of V?7, the infinitesimal, volume 
preserving distortion of neighbouring geodesics. At (s^, . . . , = 0 

we can recover this information in terms of Jacobi classes. 

The Jacobi field satisfies 

(/., ft) = {Vfjs, ft) = {Vfjt, ft) = 0. 

Hence (/^^(O, . , . ,0,a),7(a)) = 0 implies (/^^(O, , . . ,0,t),7(t)) = 0 for all 
t € [a, 6]. Since we can replace the parameter t by ^ + h{s ^^ . . . , 
where h is an arbitrary function, we can always parameterise our geodesics 
such that fsk{ 0 y . . . ,0,i) G for all k and t. Let 

be a orthonormal basis of (7)-^ which is parallel along 7 and denote by 
A : (7)-^ ^ (7)“^ the tensor field along 7 that maps Ei into /^i (0, . . . , 0, t) . 
It follows that [A] is a Jacobi tensor class and that 

{V^A)A-^U = i'7^A)Et - V^(AEi) fjs^ = V fjt = 

Since the vector fields {/^i , . . . , /^n } span (7)-^ we conclude that VyU = 
{Va^A)A~^v for all v e 7-^. This motivates the following definition. 

Definition 4.6.9. Let [A] be a Jacobi tensor class along the geodesic 7. 
Then the expansion 9 of [A] is defined by 

0 = tv{[A][A]-^), 

the vorticity tensor u) of [A] by 

^it) = \ - ([^][A]-')*) , 

and its shear tensor a by 

\i([A][^]-^ + ([A][^]-i)*)-;^id */(7,7)=0. 

The following lemma implies that 9 {t) = tr([A][A]“^) diverges where A 
is singular. 
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Lemma 4,6.20. For any a Jacobi vector class [A] we have 

where det is any parallel determinant function. (In particular^ one can 
choose the determinant function induced by the metric [g]). 

Proof. Let r = n — 1 if 7 is spacelike or timelike and r = n — 2 if 7 is 
null. 

Since the space of parallel determinant functions along 7 is 1- dimen- 
sional, it is clear that the formula in the assertion is independent of the 
choice of det. 

Assume that [A]{to) is non-singular. There is a parallel linear tensor 
class [B] such that [A]{to)[B]{to) = id. Let [C] - [A][B] and 
be a parallel orthonormal basis of [7]^. We choose the determinant func- 
tion defined by det([D]) = Det(([D]^)^^/c), where Det is the standard de- 
terminant in R"' and [D][Ek] = [D]i[Ei\. At t = to we have [C][Ek] = [Ek] 
and Det([£^i], . . . , = 1. This implies 

(det[C])-|,^,„ = (det([C][£;i], . . . , 

r 

= ^ det([£;i], . . . , [C][Ei], . . . , [Er])\t=to 

i^l 

n—1 

= Y, [^\ det([£;i], . , . , [Ei], , [Er])\t=t„ 

"=tr([0])|t=to- 

Since [A] = we obtain therefore at t = to 

(det[A])- = (det[C]det[B]-^)- = (det[0])' det[S]-^ 

= tr([(7]) det[S]-^ = det[C]-^ det([0][S]-^) 

= tr([A][S]) det[C]-^ det[A] = tr([A][A]-i[C]) det[C]-^ det[A], 

At to we have [C] = id and therefore (det [A])' = tr([A][A]“^) det[A]. I 

Lemma 4.6.21 (Raychaudhury equation). Let (M, be a Lorentz- 
ian or Riemannian manifold and j be a causal geodesic if (M^g) is 
Lorentzian and a spacelike geodesic otherwise. If [A] is a Jacobi tensor 
class then its expansion satisfies 



-Ric( 7 , 7 ) - tr(w 2 ) _ g 

-Ric( 7 , 7 ) - tr(o)2) - tr(cr2) - if ( 7 , 7 ) = 0. 
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Proof, Let r = n — 1 if 7 is spacelike or timelike and r — n — 2 if 7 is null. 
Let be a parallel orthonormal frame of ( 7 )-^. Since [A] is a 

Jacobi class we have ([A][A]"^)‘ = + [^](“[A]“^[A][A]“^) — 

— [R] “ ([A][yl]“^)^ and therefore 

^=(tr([A][^]-^))'=tr(([A][^]-i)') 

= -tT{[R])-tv{[A][A]-^f 

= -'f^s {R{Ei, ■y)j, Ei) -tr ( ra; + (T+ ^id 

If r = n ~ 1 it is clear that g {R{Ei,'y)j, Ei) — -Ric( 7 , 7 ), If 7 
is null we can find a parallel spacelike vector field En-i and a timelike 
vector field En such that {Ei, .^En} are orthonormal and 7 = En-i + 
Eri‘ Then we have 

n 

Ric(7,7) = '^g{R{j,Ei)^,Ei) 

n~2 

= J29imEi)%Ei) 

g{R (En~l + En, En^l) {En-l + En),En~l) 

~~ g {R {En-l + Em En) {En^l + En),En) 

n~2 

~ ^ ^i) 9 (R (Rn, Rn~l) Rn,Rn-l) 

i=l 

~ 9 {R {Rn—l, Rn) Rn—l^ Rn) 

n—2 

= X! 5 {R{i, Ei)j, Ei) , 

where we have used the symmetries of R and the fact that 

n— 1 

tr(B) = ^ g{BEi, Ei) - g{BEn, En) 

for every linear map B. Hence in either case, r — n-1 or r = n — 2, we 
get 

/ 0 ^ 2 

0 = Ric(7, 7) ” tr ( + (7^ + -ir-id +” (a; + cr) + ct;cr + (jo; 

\ r 

By definition we have tr(o;) = tr(cr) = 0. For any tensor [B] we have 
+ mm - [B]*)) = trim - tr(([B]*) 2 ) + tr([S]*[B]) - 
tr([B][B]*). Since the definition of the trace implies 
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n— 1 



n—l 



tr([s]2) = Y^[g] {[Bmi m) = 5^(5] im, ([BrnEi]) 



i=l 



i^l 



=tr{{[Brr) 



and 



tr([B]*[B]) = tr([B]([5]*[B])[B]-^) = tr(B[B]*) 
we conclude that tr(uja) = tv{auj) = 0. 



Lemma 4.6.22. Let j be a timelike or spacelike geodesic and [A], [B] be 
Jacobi tensor classes along 7. Then the tensor class ([yl]*)*[5] - [-A]*[B] 
is parallel along 7. 

Proof. First observe that [R] is self-adjoint. In fact, 

b] H) = (■R(t'.7)7.t^) = {R{w,i)i,v) = [ 5 ] (H, [-R1M) 

for all vectors [t;], [«;] implies [ii] = [i?]*. Hence we obtain 

(([^]*)-[B] - [^]*[S])' == ([^]*)-[B] + ([A]*)-[B] - {{AYYIB] 

- [^]*[B] 

= ([;i])*[B] - [AriB] 

= -([B][^])*[B] + {A]*{R][B] 

= -[A]*[B]*[B] + [^]*[B][B] = 0. 



We are now in a position to prove Proposition 4.6.3 

Proof of Proposition Let 7: R M be a complete geodesic and 

r = n - 1 if 7 is spacelike or timelike and r = n - 2 if 7 is null. We 
choose to e R such that 7(to))7(^o) ^ 0. The symmetries of R 
imply then that the induced operator [R]: [7(^0)]“^ [t(^o)]'^, M 

[R(u,7(^o))7(^o)] does not vanish. We need to show that 7 has a pair 
of conjugate points. Let J± be the space of all Jacobi tensor classes [A] 

which satisfy a; = 0, [A](^o) = id, and tr([A](to))|o. 

We will first show that each [A] € J± satisfies det[A](^) = 0 for 
some t^to’ Suppose (without loss of generality) that [A] G J_, Since 
the shear a is self-adjoint, tr(o-^) > 0 and the Raychaudhury equation 
implies ^ If there is a ti > to with ^(ti) < 0 then an integration 

implies ^ -f for all ^ > ti. Since the right hand side 

vanishes for some t = t2 > t\ the expansion 9 {t) must diverge at ^2- 
Consequently, det([A]) vanishes at ^2- If there is not any ti > to with 
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6{ti) < 0 then the inequalities 6{to) = tr([A](to)) < 0 and 0 < 
imply 6{t) = 0 for all t > to- From the Raychaudhury equation we see 
that cr “ 0 and therefore also [A][A]”^ = 0 for all t > to. Because of 
— —[R] ~ {[A][A]~‘^)^ this would imply — 0 for all 

t > to in contradiction to our assumption on to- The proof for [A] € 
is completely analogous. 

For each i> to let [B^] the unique Jacobi tensor class which satisfies 

= 0 and [Bi](to) = id. 

Assume for the moment that we have proved the existence of a Jacobi 
tensor class [B] with [B](t) = lim£_^^[B£](t) anddet([B](ti)) 7^ 0 Vti > 
to. Since all Jacobi tensor classes [B^] have vanishing vorticity a? so has 
[B]. Moreover, [S](to) = id implies that the Jacobi tensor class [B] must 
lie in either J_ or J^. From the (still to be proven) fact that [B]{t) 
is non-singular for t > to we infer that [B] 6 \ J“. It follows that 

tr([B](to)) > 0 and therefore that there is a f > to with tr({S£](to)) > 0. 
Since [B£](to) = id this implies that the expansion 0[b^] of at to is 
strictly positive. From the inequality we obtain 

1 . 1 ^0 - t , „ 

■Z 7TT ^ ^ < ^0* 

Since 00 (t ^ — 00) this implies the existence of a t% which 

satisfies det([B£](ti)) = 0. Hence there is a non-vanishing, parallel vector 
field V such that 

V{t) € (7(t))^ for all t and B^V{ti) = 0. 

Since the non-trivial Jacobi vector field defined by J := B^V vanishes at 
both ti and t our geodesic 7 has a pair of conjugate points. 

We still have to show that [B] does exist and that [B]{t) is non- 
singular for all t > to ^ In order to do so we will first obtain a formula for 
[B^] which depends only on a single, given Jacobi tensor class. 

Let [A] be the Jacobi tensor class which is uniquely determined by 
[A] (to) = 0 and [A] (to) = id. This tensor class is non-singular for all 
t > to since [A] (to) = 0 and 7|[to,oo) does not have conjugate points. 
Let be a parallel orthonormal frame of [7]-^ along 7. With 

respect to this frame we define a tensor class [(7] by 

([C]M)* = [A]^it) (s)dsH^ 

Observe that this definition is independent of the chosen frame since any 
two parallel orthonormal frames are related by a constant orthonormal 
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matrix D and since such matrices satisfy D*D = id. We will show below 
that \C] = [S^]. 

But first we need to check that [C] is a Jacobi tensor class. 

([c-jcoM)* = f (([^]*[^])-')l ( 5 )ds[u]*= 

J t 

= [AimiA]-y^i[c][v]y - (([ ad -^ u ])^ 

and therefore 

+ [A][>i]-i ([A][A]-Ac']-([^]*)-i)h 

+ ([A]*)-AA]*([A]*)-ih 

= [A][A]-1[c?]H + ([A][A]-i - ([A][^]-^)*)([^]*)-iH 
= [A]{^]-^[c-]fy], 

where in the last equation we have used that [A][A]”^ is self adjoint by 
Lemma 4.6.19. Hence we get 

[C\ + [iJ][C] = [A][A]-AC] + [R]{A][A]-^C] 

= ([A] + [J?][^])[A]-1[C'] = 0 

and [C] is indeed a Jacobi tensor class. 

Now we show [C] = [B^]. Since [C]{i) ~ 0 = [S£](t) the equality 
[B^] = [C] follows once we have shown [C](t) = [^£](t). Lemma 4.6.22 
and (([A]*)-[B£] - [A]*[4 '])|jo = id imply 

a^]*)-[B,-]-[A]*[4-] = id 

at all t € [a, 6]. In particular, we get id = -[^]*(£)[Bj-](£) since — 

0 . This in turn is equivalent to [Bf](t) = — ([ 4 l]*)“^(t)). On the other 
hand, [C](t^ = 0_ implies [C'](t) = [A](t)[A]-^(£)[C](t) - = 

— ([.< 4 ]*)“^(t) = This completes the proof of [C] = [B^j. 

We can now employ the formula 

m)i = \m) [ m*[A)-%{s)As 

in order to show that [B] = lim£_^^[B£] exists if for some a < to the 
geodesic segment 7 : [a, 00) — > M does not have any conjugate points. We 
will prove this by showing that both lim£_^oo[B£](to) and limj-_oQ[.B(-](fo) 
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exist (cf. Corollary 4.6.6). The existence of the first limit is trivial since 
[jB|](to) = id for all t. By Lemma 4.6.19 and [B^Kto) = id it follows that 
([^d)*(^o) = [Bi\{to) for all t. Polarisation implies then that [B£](to) 
is uniquely determined by the numbers [g] ^[B£][t;] Jt;] j where [u] 6 

[ 7 (^ 0 )]'^- We will now show that [g] [^;]^ have a well defined 

limit for every [u] e [ 7 (^o)]‘^« Since we use an (orthonormal) parallel 
frame along 7 and [A](;^o) = id we have 

[ 5 ] ([4-](<o)M, M) = SalAf^ito) (([A]*[A])-i); 

-mwr\t)[v\M) 

and therefore for alH+ > t_ 

b](l4%](^o)H, N) - [p] h) 

= I'^g] (([A]*[^])-^s)M^[^fe](5), [vYmis)) ds. 

The last expression is non-negative since 

= [5]([A]([A]*[^])-H^;UA]([A]*[>i])-n^]) 

and [^] is positive definite. Hence the function i [g] ^[H^-](to)Mj M) is 
monotonically increasing for every [ 1 ;] G [ 7 (j^o)]'^- We will now show that 
[p] ([■Bt](<o)M,M) < [p] ([-Ba(<o)]H,H) for all H € [7(io)]-^- This will 
give an upper bound for the monotonically increasing function 

t ^ [fit] ([4-](^o)H, m) 



thereby ensuring that the limit exists. Theorem 4.6.2 implies that the 
index form positive definite in all the cases we consider. 

Hence applying the piecewise smooth Jacobi vector field 



J(t) = 






for t € [a,fo]j 
for t e {to, i]. 
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= ( (v^J, v^j) + {R{J, 7) J,7) )dt 

= - {Bi(to)v,Bi{to)v^ + (^Ba{to)v,Ba{to)v'^ 

= -b) ([4-](io)M, [Bf](fo)H) + b] ([-Ba(io)]M, [S.](to)b]) 

= -b] ([4 'K^o)H, m) + b] ([Ba](to)H, h) 

which imphes the desired bound. Let [S°]: (7(to)"‘‘ d'C^o)"'’ be the 

unique selfadjoint tensor class defined by 

b] {[B%], H) = .lim b] ([4-](to)M, M) . 

Then [-B£](to) [5^] and, consequently, [B^] [B]^ where [B] is the 

Jacobi tensor class defined by [i5](to) = id, [S](to) = This proves 
the existence of [B] , 

We have still to show that [B](t) is non-singular for t > to. From our 
construction it is clear that [B] is given by 

POO 

[Bm = [A]){t) (([A]*[A])-I)^,(s)ds. 

Let [u] e [ 7 ( 0 ] \ {0} and V be the parallel vector field along 7 with 

V (t) = V. Then 

bl([^]-M-B]N,M) 

/ oo 

bH([A]*[>i])-MF](s),[y](s))ds 

/ oo 

[p] ([A]([^]*[yl])-i[y](s), [A]([Ar[A])-i[l/](s)) ds 

> 0 



implies that the operator ([A] ^[J5])(t) is is non-singular. Thus B is the 
composition of non-singular operators and therefore also non-singular. 
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Einstein^s equation is of the form Vg = T, where V is an operator 
acting on the Lorentzian metric g and T an expression which describes 
the distribution of matter in the universe. In Sect. 5. 1 we motivate 
that T should be a symmetric ( 2 ) tensor field which is divergence- free, 
and in Sect. 5.3 we find an expression for Vg. 



p. 210 i 
[i P> 270] 



5.1 Matter 

In Chaps. 1 and 3 we did not explicitly consider gravity. However, one of 
the main insights of Einstein was that gravity and the geometry of space- 
time are closely linked. His argument is very simple and runs roughly as 
follows. 

The movement of a particle which is subjected to a fixed external 
“force field” depends on its initial location, its initial velocity, its mass, 
and its charge (i.e. its “sensitivity” to the force field). For instance, a 
particle in an electric field which is initially at rest will move to one 
side if it is positively charged, to the opposite side if it is negatively 
charged and not at all if it is neutral. To be more concrete, consider a 
reference frame (r, t) in a Galilei spacetime and suppose that there is a 
non-relativistic particle (m, 7 ) which is located in an electric field € and 
has the electric charge e. Then the formula 

m 7 = e • €(t, 7 ) 

holds. Similarly, let (S be a gravitational field and g be the “gravitational 
charge”^ of the particle. Then 

= (5,1.1) 

holds. It is an experimental fact that the quotient ^ depends on the 
particle whereas the analogous quotient ^ is a universal constant and 
can be set = 1 (Eotvos 1896). Einstein concluded that this fact is not a 
mere coincidence but reveals that gravitation is an acceleration (rather 
than a force) and therefore something geometrical. He therefore replaced 

^ It is usually called the passive gravitational mass. 
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the equation 7 = © by the geodesic equation V^7 === 0 and the “force 
field” © by the connection V of spacetime. This point of view leads 
to a physical interpretation of inertial observers: They are simply those 
observers which are freely falling. 

It is an experimental fact that the matter distribution^ spacetime 
determines gravity. Hence we have to look for an equation of the form 

Vg - T, (5.1.2) 

where (M,p) is an n-dimensional Lorentzian manifold^, V is some kind 
of operator acting on the metric g, and T contains the information on 
the matter distribution. The “correct” form of T cannot be derived. First 
of all, it is beyond doubt that matter cannot be described by a smooth 
object in spacetime but instead demands a quantum description. This 
implies that we can hardly expect a description from fundamental, phys- 
ically suggestive principles. T will therefore be a classical approximation, 
i.e. something phenomenological. Consequently, our final form for Equa- 
tion (5.1.2), Equation (5.3.11) will appear to be grounded less firmly than 
the spacetime structure. However, the reader should recall that in the 
derivation of the Lorentzian structure of spacetime we already assumed 
that light can be described in an entirely classical (i.e. non-quantum) 
way. 

The only matter models we had considered so far where special- 
relativistic point particles (cf. p. 44) which admit a straight-forward 
generalisation. 

Definition A particle is a pair (m, 7 ), where m>0 is the mass 

of the particle and j is a curve in M with g{'y{t)^j{t)) = ~1 for all 
t 6 M, representing the history of the particle. 

Exactly as in the special-relativistic analogy, an infinitesimal observer 
V a.t X = 7(0) measures the energy = -mg{^{0)jv) and the spatial 
momentum 7(0)"^ = 7(0) - ^v. The following simple observation will 
serve as a guidance for defining T. 

Lemma 5.1.1. Let x ~ 7(^0) ^ M and {wi , . . ,jWn} be n linearly in- 
dependent timelike vectors with {wi^Wi) ~ —1. Then m'j{to) and m are 
determined by the numbers , . . . , . 

Proof, Since {g{wi^ -)}i=i,..Mn Is a basis of T*M, m'y is uniquely deter- 
mined by Eyj^ , . . . , Ewr, and m can be calculated from = g{m'y^ mj). 

I 

^ Here we use the term “matter” in a rather wide sense encompassing all forms 
of energy. This is motivated by the special- relativistic equation E = mc^ 
(c = 1: velocity of light) which asserts that (rest) mass is simply a form of 
energy (cf. Sect. 1.4.3). 

^ The spacetime we live in appears to be a 4-dimensional Lorentzian manifold. 
However, in this book we will not specialise to n = 4. 
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In other words, we only need to know the energy function 

E\ “[i/ € TxM I “ — 1} — ^ ^ ^ 

in order to recover the complete information about a single particle. 

Since is a smooth object, we would expect T to be smooth also. 
This indicates that point particles which are not depending smoothly on 
the coordinates of M cannot be used to constitute T. The simplest way 
to obtain a smooth matter distribution from a collection of particles is 
to consider averages instead of individual particles. 

Definition 5.1*2. A congruence of particles is a pair (e, C/), where 
e: M R is a function and U is a future directed vector field with 
g{U,U) = -l. 



The integral curves of U are identified with the world lines of the particles 
and the energy density function e with the energy density, measured by 
comoving observers. To keep the presentation simple we will restrict to a 
special case and assume that dU^ AU^ =0, i.e. that there exists locally a 
spacelike hypersurface E which is orthogonal to U (cf. Theorem 2.5.4). 
If B C X* is a compact region then an observer flowing with the particles 
measures for the energy of those particles which pass through B the 
quantity 

E~ tjJLE) 

Jb 

where (JLe is the induced volume form (cf. Definition 4.2.2^). Since a 
single observers must be identified with a timelike curve rather than a 
congruence of curves this expression should be understood as an approx- 
imation for small B. It is clear that we recover the definition of a point 
particle if the compact set supp(e) nX shrinks to a point and the energy 
density e increases adequately. 

A different congruence of observers, represented by a vector field V 
with giy^V) = — 1 and dV'^ A 1/^ — 0, will measure a different energy 
content, 



where 



Ey= eyfiEvi 

JBy 



“ Xy is a spacelike hypersurface orthogonal to F, 

— jiEv volume form induced on V, 

- By = {x e Xy : 3 a particle through x which intersects X}, .and 



^ Readers who have not read Sect. 2.5.4 may wish to do so now. Alternatively, 
they may (for the time being) refer to the footnote in Definition 4.2.2. In 
the following we will make use of calculus for differential forms (Sect. 2.5) 
in order to avoid clumsy but straightforward calculations. 
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- ey is a function which depends on the congruence of particles (e, U) 
and the obser^/er field V. 

We will now motivate a transformation law c i~4 ey through comparison 
with special relativity 



Fig. 5.1.1. A localised 
congruence 

Consider Minkowski space ( , rj) and the inertial observer field 

V: X ^ V 

where t? is a vector with r]{v,v) = -1. Assume also that the congruence is 
localised, i.e. restricted to any spacelike hypersurface ey has compact 
support (cf. Fig. 5.1.1) The inertial (or freely falling) observers t x^tv 
with common rest space x measure the energy 

Ey / Cy / €y (F 

where /iA«- is the volume form of rj. We assume now that the congruence 
of particles is freely falling, i.e. the field U satisfies the geodesic equation 
VjjU = 0. In the limit that supp(ey) n E shrinks to a point we would 
recover the energy associated with a single freely falling particle. 

Since ~ 0 we can choose linear coordinates . . ,x^“^) 

such that 

u^do, v = - ^ (ao + ||v||^i). 

\/i - ll^ll 

This gives 

Ey = f ev , ==^go + lyil^i) -IM t; 

Jx+v^ yi-lli7|l ^ ^ 
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= f ey . — L= A ■ ■ » A dx'^ ^ 

Jx+v^ 7i-||y|r 

- ||y||dx° Ada;^--- Adx"-^) 

== .1.... - = j eydx^ A ‘ • A dx'^'^ 

^l_||yf -/x+V- 



mi 



j 



eydx^ A • • • A da:' 



n— 1 



yiTjyf 

= _i=r%w- "ELfAvdxO 

\/i-iivim"- vi-mm"- 



where ey(x^^x^) = / eyda;^ A • ♦ • A da:^ ^ and a:^ denote the maximal 
(minimal) values of x'^ restricted to the support of ey in a: + V-^. In this 
hyper plane, we have = \\V\\x^ and therefore 



Ev = \/l-|ivf ?vdx^ 

Let 7 : t H 4 (t,a:^, . . . and let (m,7) be the corresponding freely 

falling particle with rest mass m. Then its energy — measured in its own 
rest frame — is £0 = “ ii.raj) - ”m(5o,9o) = rn. The energy mea- 
sured by the infinitesimal observer v ~ - ||VP^ (9o 4- l|i^||9i), 

is given by 



Ey = ~ {V, 7717 ) = ' r ' '.. = 



Eq 



An analogous relationship should also hold for our smooth congruence 
U since this congruence can be used to smoothly model a point particle. 
Hence we should have 

1 /'®+ 1 
= ■ = / eda: . 

Vi-i|yf 

Since x 1 —^ cy (x) was arbitrary this equation implies ey = - "^^is 

transformation law indicates that ey depends quadratically on V, 



Postulate 5.1.1 (Tensorial character of energy momentum). 

The map T is a symmetric (^’tensor field and the energy density 
measured by an infinitesimal observer v is given by T{v,v), 
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In the special case of our congruence of (non-interacting) particles there 
is a simple, well defined tensor field Tu , namely 



Tu = tU^®U\ 



Observe that ey = Tu{V,V) is in accordance with the transformation 
law derived above. 

The following lemma indicates that it is enough to know all possible 
energy densities in order to recover the tensor T (cf. Lemma 5.1.1). 

Lemma 5*1*2. Let T be a symmetric ( 2 )“fensor. Then T is uniquely 
determined by the values T(u,u) for all vectors u with g{u^u) = —1. 

Proof . Let T, 5 be two symmetric tensors with T(u, u) ~ S{u, u) for 
all u with (u , u) = — 1 and l et u be a tim elike vect or. Then the vector 
v/V- (^>^>) satisfies g{v/-\/^ {v,v,),v/ = -1 which yields 
T{v,v) = S{v,v). Since the space of all timelike vectors is open there is 
for every vector ty a 5 > 0 such that u + tty is timelike for all < € [-5, J]. 
Hence we obtain 



T{w, w) = ^ + tw,v + fty)) 

1 f d^ \ 

= + + =S{w,w) 

and the claim follows from the polarisation identity. I 

Conservation of energy and momentum is another fundamental prop- 
erty of matter which we wish to encode in our theory. We will find an 
infinitesimal formulation which (in special cases) recovers conservation of 
momentum (cf. Equation (1.4.12)). In Sects, and 1.2.1 and 1.4.3 we have 
simply stated conservation of momentum. These conservation laws can 
actually be derived within the theory of point particle mechanics. This 
is the content of the Noether Theorem which is covered in textbooks on 
mechanics. The main non-mechanical input for the Noether theorem is 
the Galilei group (in the non-relativistic case) and the Poincare group 
(in the relativistic case). Recall that the Poincare group is the set of all 
isometries of Minkowski spacetime. In order to find an infinitesimal for- 
mulation of conservation of momentum we will therefore have to employ 
Killing vector fields which can be regarded as infinitesimal analogues of 
1-parameter groups of isometries. 

Lemma 5.1.3. Let T be a symmetric (^-tensor with div(T) = 0 and ^ 
be a Killing field. Then div (T(^, *)^^) = 0. 

Proof, since T is divergence-free we have 
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div(T(^, .)«) = Va (T“*’6) = (VaT'*'’) 6 + (Va6) = {Va6) . 

Now the symmetry of T and the anti-symmetry of (cf. Lemma 4.5.2) 
imply that the second summand also vanishes. I 



Let be a foliation of M into spacelike hypersurfaces with future 

normals nt. A world tube with respect to is an open subset W of 

M with piecewise smooth > timelike boundary such that the intersection 
WC\St is connected for all t. If W is a world tube with respect to {St}teu 
then we denote the^ subset ^ by yVti,t 2 the part of 

the boundary which is not contained in Un U St 2 by Wtime* 

Corollary 5.1.1. Let T be a symmetric (^-tensor field with div(T) — 0 
and ^ be a Killing field. Let ti <t 2 and W be a world tube with respect to 
that snpp{T)f]yVtime ~ 0' Then the following conservation 

law holds. 



I J/iM- 

J St.2C^Wti,t2 d Et^nWti,t2 



Proof We have 



for any (n — l)-tuple of vector fields tangent to St. Hence pulled back to 
^ti /2 we get (nt,T(^, -)••) nj J/^m = ~T{C, •)“ -IMm- This (and supp(T)n 
Wtime = 0) imply 

/ («t)T(^, = / (nt2,r(^, •)■') nti J/^M 

Jaw JEt2r\\Vt^,t2 

-f («ti)T(^, •)**) *^f2 -IMm, 

J nW/,j ,t2 

where we have used that the future and past boundaries St 2 and St-i_ have 
opposite induced orientations. The assertion follows now from the Theo- 
rem of Stokes 2.5.5 since d((nt 5 T(^, -)^) (ut J/xm)) = “div(T(^, = 

0.5 I 

Hence the quantity •)^) (n^ J^um) is independent of the time 

parameter t defined by the foliation if div(T) = 0. We will now identify 



5 Readers who have skipped Sect. 2.5 may instead apply the integral theorem 
of GauB. 
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this quantity with a component of special-relativistic momentum in the 
context of Sect. 1.4.3. 

We will assume that spacetime is isometric to Minkowski spacetime 
before a time ti represented by a spacelike hypersurface and after 
a time t 2 represented by a spacelike hypersurface St,. We will study a 
matter model which consists of k freely falling congruences of particles 
in the region before Un and after St,. In between these hypersurfaces 
interactions or collisions may take place. Hence in this region we will (at 
this point) neither make an assumption on the matter model nor on the 
metric. 

To be concrete, consider the set A^, a point o € A^, and a non- 
vanishing constant 1 -form r. This 1 -form defines a foliation of A with 
affine hypersurfaces = {x € A : 7t(x— o) — t}. Let ?7 be a Minkowski 
metric such that Et are spacelike hypersurfaces and let V be the time- 
like, future directed constant vector field which is orthogonal to all Et 
and satisfies rj{V,V) = — 1 . Assume that the spacetime (M,^) satisfies 

M A and 9\{x€M: 7r(a;)0[ti,t2]} ” V\{xEM: 7r(a;)^[£l,t2]} • 
be the congruences of particles defined at all points x with tt{x) ^ 
(^ 1 ,^ 2 )- Assume that VjjXJi = 0 and that the energy densities sat- 
isfy supp(e^) n {x € M : 7t(x) G {^1,^2}} is compact. 

Let T be a symmetric ( 2 ) tensor field with div(T) = 0 and 

k 

Tx = ® {Uit for 7 t(x) i {h,t2) 

i—1 

and assume that (M^g) admits a Killing vector field Corollary 5 . 1.1 
implies that 

f (F,,T(^,.)“> vj/.A«= / 

Conversely, it is clear that div(T(^)) = 0 must hold if the integral equal- 
ity is valid for all such particle flows. 

Since the vector fields Ui are constant for x e Et {t ^ [^ 15 ^ 2 ]) we 
get div(T) = — 0- If vector fields Ui are at each 

point linearly independent then this equation implies dei{Ui) — 0,i.e. the 
energy density of each particle flow is constant along its flow lines. In the 
following we will assume that this is also the case if these vector fields 
are (pointwise) linearly dependent. Then the in-going and out-going rest 
masses ( 771^)2 defined by 

(^i)a = / CiUi 

where (zi)a is chosen such that 
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- supp(ei) n{x e M : 7r{x) € [^ 1 ,^ 2 ]} = 0 a.nd 

- 7r((2i)i) < h, 7i’((^i)2) > h 
are well defined constants. 




Fig. 5.1.2. Transforma- 
tion of the mass density in 
special relativity 



Assume now that the Killing vector field ^ is constant for t > t 2 y 
t < ti, and denote these constant vector fields by For each vec- 

tor field Ui and a € {1)2} there are vectors {ei)a with {V,{ei)a) = 0, 
and 

(^+ll^ill(ei)a)- 

A -11^# 

The integrals in the formula above reduce to 



k 

[ {V, T(^, •)“) J ^ (V Ui) {U, ^a) V J MA- 

^ta i~l 

k 



: V J /iA'"' 






where we have taken the length contraction into account (cf. Fig. 5.1.2). 
Hence we recover conservation of special-relativistic momentum. This 
motivates to demand the second matter postulate 
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Postulate 5*1.2 (Infinitesimal conservation law). The tensor field 
T has vanishing divergence^ div(T) = 0. 

Postulate 5.1.2 is interpreted as an infinitesimal formulation of conser- 
vation of energy and momentum. That these quantities are conserved is 
intuitively clear from the absence of a perpetuum mobile. However, the 
infinitesimal formulation implies a true conservation law only if space- 
time is endowed with a Killing vector field. In general, this is not the 
case. It follows that conservation of energy can only hold infinitesimally. 

Definition 5.1.3. A symmetric ( 2 ) -tensor field with div(T) ~0is called 
an energy momentum tensor. It is sometimes called stress energy mo- 
mentum tensor or stress energy tensor. 



5.2 Some specific matter models 

If T and g are simultaneously diagonalisable, 





/-I 0 


... ON 




(e 0 


... 0 \ 




9 = 


0 1 : 

: 0 


,T^ 


0 Pi ■ • . ; 

: 0 


, (5.2.3) 




V 0 ... 


0 1 j 




Vo ... 


0 Pn-1 / 





then € is interpreted as the energy density with respect to the flow of 
matter and Pi (i £ {1, . . . , n - 1}) are interpreted as principal pressures. 
To motivate this interpretation we will simplify to a perfect fluid, i.e. a 
matter distribution for which all principal pressures are equal. 

5.2.1 The perfect fluid 

Definition 5.2.1. Let e^p: M R be smooth functions and U be a 
vector field with g(U^ U) = —1, Then 

T={e+p)U'’®U^+pg 

is called the energy momentum tensor of a perfect fluid. A perfect fluid 
with p = 0 is called dust. 

Observe that the energy momentum tensor T = eU^ O considered in 
the motivation of Postulates 5.1.1 and 5.1.2 describes dust. 

Lemma 5.2.1. Let T be the energy momentum tensor of a perfect fluid. 
Then div(T) = 0 is equivalent to 

de{U) = -{e + 2?)div(t/), {e+p)VjjU = -7r[;xgrad(p), 

where denotes the projection to the orthogonal complement ofU. 
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Proof, It is straightforward to calculate 
(divT)a 

= g^\dc{e + p)UaUb + (e + p)((VcUa)Ub + Ua(VcUb)) + d^pgab) 

= A{e+p){U)Ua + (e + p)(Vj/[/)a + (e + p)div{U)Ua + grad(p)a 
= (de({7) + (e + p)d\vU)Ua 
+ (€ +p)(Vf/C/). + (([/' 5)(grad(p), 

The assertion follows since 1 U and [U^ g) is the metric 

projected to U'^ . I 

The vector field U is the velocity of the fluid particles and e the en- 
ergy a comoving observer would measure. If the divergence of this vector 
field is negative then the particles are getting closer together and conse- 
quently the energy density should increase. This is expressed in the first 
equation. The second equation states that the spatial acceleration of the 
fluid particles is proportional to grad(p). This indicates that p should be 
interpreted as the pressure exerted on the fluid. 

Perfect fluids are phenomenological models and the equations im- 
plied by div(T) = 0 tend to develop shock waves. It is therefore often 
argued that perfect fluids exhibit properties which are not shared by real 
matter. However, perfect fluids are prevalent in cosmological models of 
the universe. 

5.2.2 The collisionless gas 

An attempt to arrive at a more realistic matter model is to consider a 
relativistic gas. The idea is that we do not have a congruence of par- 
ticles but that each individual particle can move in any direction. The 
energy momentum tensor is then obtained by averaging over all particle 
velocities. Let be a coordinate system of M and choose 

the canonical coordinates • • -Pn-i) of T*M which are 

defined by a = pa{a)dx'^ for every 1-form a € T^M. A relativistic gas 
can be described by an energy momentum tensor 

Tah{^) = / PaPbf{x,p) (- det((pc^^)c,d=o,...,rz-i))”^ dpi A • • • A dpn, 
Jp+{x) 

where P'^(.'r) C T*M denotes the set of future causal 1-forms and 

/: p-^(x) 

is a density function. We assume that for \pa\ oo the function f{x, ♦) is 
sufficiently rapidly decreasing so that the integral is well defined. Observe 
that the n-form 
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(- det((5cd)c,rf=o,...,n-i)) " dpi A • • • A dp„ 

does not depend on the choice of coordinates (x^, . . . The rela- 

tivistic gas is collisionless if the Liouville equation c1/(Xh) = 0 holds, 
where 

= 9°'''Padxi> - ^dx^9°’^PaPbdp^.^ 

Using a system of normal coordinates it is easy to see that df(XH) = 0 
implies div(T) = 0. 

If [/ is a vector field and / was replaced by the delta distribution 
5{^U^ — one would obtain dust. Hence dust may be viewed as a 
“gas” whose molecules are all aligned and move into a preferred direction 
determined by the vector field U, 

Analogously, a relativistic photon gas is given by an energy momen- 
tum tensor of the form 

Tab{x)^ PaPbfo{x,p)l^p+M, 

where Pq (x) C T*M denotes the submanifold of non-va nishing future 
null 1-forms at x, ^'p+(a.) is a non- vanishing, oriented n- 1-form on Pq (a^)i 

and fo: PqW K"*" is the photon density function with respect to 
^P+{x)' We assume that for [pa| oo the function fo is sufficiently 
rapidly decreasing so that the integral is well defined. 

The following lemma implies that the energy density associated with 
a relativistic gas is always positive. 

Lemma 5. 2. 2* Let T be the energy momentum tensor of a relativistic 
gas (respectively, photon gas) with / > 0 (respectively, fo > 0), Then 
T{u,u) > 0 for all timelike vectors u unless the density function f (re- 
spectively, fo) vanishes. 

Proof, This is clear since for each vector u the integrand in the definition 
for T{u, u) is positive unless / (respectively, fo) vanishes. I 

Lemma 5.2.3. Let T be the energy momentum tensor of a photon gas. 
Then tr(T) — 0. 

Proof, The assertion follows from 

tr(T) = / PaPbfo{x,p) = f 9“(P.P)/o(x,p) = 0 

~'PQ 

since the 1-forms p are null. I 



® Readers who have knowledge of mechanics will notice that Ah is just the 
Hamilton vector field to the Hamiltonian function H{x,p) ~ \g°'^PaPb- The 
equation d/(AH) — 0 expresses then conservation of mechanical energy (cf. 
(Ehlers 1973) for details). 
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5*2.3 The electromagnetic field 

An electromagnetic field can be described by a 2-form F which satisfies 
MaxweWs equations^ 



dF - 0, (5.2.4) 

div(F) = J, (5.2.5) 

where J is interpreted as an electromagnetic current one form. The first 
equation can be geometrically explained within gauge theory (A small 
volume which contains the essentials of gauge theory is (Bleecker IQB!)*^). 
The second equation does not have any content without a prior inter- 
pretation of J. For our purposes it is sufficient to note that J is linked 
to other forms of matter. 

Remark 5,2 J. Using the Hodge star operator we can write ★d^ F = J 
instead of div(F) = J. 

If there is no interaction between electromagnetism and the other matter 
fields, i.e. if matter is neutral, then we have the set of equations 

dF - 0, (5.2.6) 

div(F) - 0. (5.2.7) 

These equations are called the source-free Maxwell equations. 

The electromagnetic part of the energy momentum tensor is given by 

(TeOab = ^ ~ \ {F, F)gab^ . (5.2.8) 

We will sketch in Sect. 5.3.1 below how one may justify these formulas. 

There are many mathematical texts on “gauge theory” which are very mis- 
leading. For “mathematical convenience” (or lack of physical knowledge) 
the Lorentzian metric of spacetime is replaced by a Riemannian metric. 
This leads to equations which are of a very different nature from those 
which describe physics. Only in very special cases (a prerequisite is that all 
functions are analytic) is it possible to convert results of the Riemannian 
theory to the Lorentzian theory using an analytic extension argument ac- 
cording to which one can “rotate” a Riemannian theory into a corresponding 
Lorentzian theory, where both theories are embedded in a complex theory. 
In the literature on quantum field theory this rotation is known as the Wick 
rotation. The Riemannian analogue of gauge theory is mathematically (but 
not necessarily physically) of interest because it is linked to the well devel- 
oped theory of elliptic partial differential equations. Gauge theory, on the 
other hand, is linked to hyperbolic partial differential equations. To sell the 
Riemannian analogue as gauge theory has presumably the advantage that 
it makes it easier to get funds for research in pure mathematics. On the 
other hand, it does confuse people. A pure Mathematician who worked in a 
field closely related to this Riemannian analogue and who saw work using 
the Lorentzian metric instead of a Riemannian metric once even asked me 
whether this Lorentzian approach would also be useful to physics! 
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Lemma 5*2*4* Let F be a closed 2-form and assume that Tqi is given 
by Equation 5.2,8. Then we have div{Tei) = ^div(F^)) = F(-, J^) 

Proof. Since dF = 0 we have VaF^c 4- ^bF'ca + ^cFab — 0 which implies 
iF“<=V6Fco = -F“<=VoF6c. This gives 

47rdiv(T)6 = V“ (^g’^'^FacFbd - g^^FecFfdQab'^ 

= g^Fbd'^^^Fac + g’^^Fac^^Fbd - \g^fg^Ffdgab^'^F,c 
- \g^^g^F,,gabV^Ffd 

= F{; div(F“)) + g'^FacV^Fu - 

= T'(-,div(f’»)) + F'^‘^VaFbc + ^F^^VbFce 

= F{;div{F^)). 



Corollary 5.2.1. Assume that the source-free Maxwell equations hold. 
Then 

div(TeO - 0. 

Remark 5.2.2. Recall that in the derivation of the Lorentzian struc- 
ture of spacetime we assumed that light rays can be described by null 
geodesics. Since light is electromagnetic radiation we should now check 
that this identification is consistent with the description of electromag- 
netism in this section. However, this would require a proper discussion of 
electromagnetism which is beyond the scope of this book. Readers with 
knowledge of electromagnetism may consult (De Felice and Clarke 1990, 
section 7.8) for the identification of light with lightlike geodesics. Here 
we can only say that null geodesics can be taken as a description of light 
rays in an (observer-dependent) limit. 



5.3 Einstein’s equation 

Recall that the equation which links geometry and matter should be of 
the form 

Vg = T. 

In the preceding two sections we have motivated that the right-hand side 
of this equation should be a symmetric, divergence- free ( 2 ) tensor field. 
Now we will find an expression for the left-hand side. 
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In the Newtonian theory of gravitation, gravity is described by the 
equations 

f =0=grad(0), (5.3.9) 

A4> = kg, (5.3.10) 

where (j) is the Newtonian potential for the gravitational field. Equation 
5.3.10 is a second order partial differential equation for the Newtonian 
potential and describes how it is related to the mass density 
varrho of the universe. 

Recall that we have replaced Equation 5.3.9 by the geodesic equation 
V ^7 = 0 which is equivalent to 

T = -A“77- 

It follows that the Christoffel symbols have a role similar to the 
gravitational force field 0. One obtains the Christoffel symbols from g 
via differentiation, just as one obtains the gravitational force field 0 from 
the Newtonian potential (f> through differentiation. This indicates that (f> 
corresponds to the metric g. Since the Newtonian potential is related to 
the matter distribution via a second order partial differential equation, 
we expect that ^ i-^ is likewise a second order operator. 

Postulate 5.3.1 (Gravitation is determined by a 2^‘^-ord. pde). 

In any given coordinate system, T>\ g T>g is a pointwise smooth 
function of gcd, daQcdi dad^gcd- 

Theorem 5.3.1. Let (M, g) be a Lorentzian manifold such that dScal ^ 
0^ and Vg be a ( 2 ) tensor field which satisfies Postulate 5,3.1 and 

diviVg) = 0 



(cf Postulate 5.1.2). 

If in addition, Vg is linear in dad^g then there exist constants 6 
M such that 

1 

Vg = jj,(Ric - ~Scal^) + Ag. 

Proof. By Corollary 4.3,1 and the linearity assumption Vg must be of 
the form 

Vg = ciRic + C 2 Sca\g + csg. 

Lemma 4.3.1 implies now 

0 = div(P^) = 4* C 2 ^ dScal. 



® The condition states that dScal is not the null-function, i.e. dScal does not 
vanish identically. 
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Hence the result follows by our assumption that there is an rr € ilf with 
dScal|aj ^0. I 



Remark 5.3,1, The assumption that Vg is linear in its highest derivatives 
is rather awkward. Lovelock (1972) has shown that in 4-dimensional (but 
not in higher dimensional (!)) Lorentzian manifolds this assumption is 
not needed. Unfortunately, his proof is far too involved to be reproduced 
here. 

Remark 5.3,2, Observe that we did not even need to assume symmetry 
of T>g, i.e. Postulate 5.1.1 is superfluous. However, the symmetry as- 
sumption was important to prove the conservation property Corollary 
5.1,1 which motivates the requirement div(T) = 0. 

In conclusion, our postulates imply that gravity is governed by Einstein’s 
equation as defined below. 

Definition 5.3*1. Einstein’s equation (with cosmological constant^ yl € 
E is given by 

Ric - ^Scal^ -\-Ag = SttT, (5.3.11) 

where T is the energy momentum tensor describing the matter distribu- 
tion. 

In the above form, Einstein’s equation is valid in geometrical units where 
the Gravitational constant and the velocity of light are set to 1 (cf. (Wald 
1984, appendix F) for explicit translation rules to other units). 

Remark 5,3.3. Einstein’s equation itself does not indicate any special 
value for A, 

In the past, astronomical observations seemed to imply that |.4| is 
very small, if not zero. It should also be noted that the Newtonian theory 
of gravitation arises as a limit for c oo (c: velocity of light) if and only 
if ^ = 0. This implies that A must be very small if non-zero (cf. (Hawking 
and Ellis 1973, p. 362), (Sandage 1968)), 

On the other hand, I have been told that to present day cosmological 
data point to a non-zero value for A, 



Some of the theorems which will be presented do require yl = 0, 



'Scalp = SttT, 



(5.3.12) 



[p. 255 i] 
-^9 

i p. 270 



and in much of the literature Einstein’s 
with equation 5.3.12. 



equation is used synonymously 



^ Our guide ends with Einstein’s equation. For what follows we will also use 
the material which has been skipped in order to get to Einstein’s equation 





5.3 Einstein’s equation 271 



5.3.1 The Lagrangian formulation of Einstein’s equation 

In this section an alternative way which leads to Einstein^s equation 
is sketched. This approach also aids in finding an appropriate energy 
momentum tensor. Unlike the rest of this book, this chapter rests on 
an underlying principle which is difficult to verify directly. 

This section can he omitted on first reading and is not required for 
any other part of this book. 

It appears that all fundamental dynamical equations in physics admit a 
Lagrangian formulation. According to this formulationj a physical system 
is described by a Lagrange function C: E where E is an appropriate 
generalisation of a vector bundle over spacetime M which contains the 
possible physical states of the system. 

Such a setup is motivated by classical mechanics. One can calculate 
the movement of a point-particle 7: [a, 6] ^ with mass m which 
is subject to a conservative^^ force field through the variation of an 
associated Lagrange function. Let £ : x ^ R be given by C{x, v) ~ 

T ^ Then a curve 7 satisfies the equation 



mj{t) = -gradVj^(i) 
(cf. Equation (1.2.7)) if and only if 



-\-rh{t))dt = 0 

Vary |^^0^[a,6] 



(5.3.13) 



for all smooth maps h: [a,b] 1—^ R^ with h{a) = h{b) ~ 0. In fact, we 
have 



|T=o ■/[«,&] 

= ^ ("i (^) (7(0 .^W)r3 

- dV"(/i(t))^dt 

quickly. Since we will discuss now physical applications which make use of all 
the mathematics which we have skipped, a continuation of this guide would 
lead to a lot of skipping forward and backward. On the other hand, the 
reader should have by now enough physical motivation in order to read the 
mathematical sections which we have skipped without getting bogged down. 
Still, the reader is advised to read on and to skip back only when needed. 
On these occasions skipped material should probably be read section-wise. 
Here “conservative” simply means that there is a function V: with 

F = — gradF, 
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= m /i(0)r3 



- / + gradVj^(t), dt. 

a 



the first summand vanishes for all choices of h with h{a) = h{b) = 0. 
Assume, there is a to ^ b) with mj{to) + gradVj^^(tQ) 0. Since the 
scalar product is non-degenerate there is an ho € W with 



(m^ito) + gradVj^(t„), /io)r3 5^ 0 

(say > 0). Let h: [a, 6] — be any smooth function with h{to) = ho- 
By continuity there is a neighbourhood C (a, 6) of to such 

that <^m7(t) +gradX^^(£),ft(t)^^^ > 0 for all t € (t_,t+). Finally, let 

ijj: [a, 6] — )■ R“*” U {0} be a smooth positive function with non-empty sup- 
port in (t_ , t-h). Setting h ~ 'iph the integrand + gradVj^(^), 

is non-negative and strictly positive in an open set. Hence the inte- 
gral must be positive in contradiction to Equation (5.3.13), This proves 
my{t) 4- gradVj,y(t) ~ 0 for all t. 

The existence of a Lagrangian formulation is widely seen as funda- 
mental for general (classical) physical systems. The physical state of an 
elementary particle^^ is described by the section ^ : M — > of an appro- 

priate vector bundle. Its governing equation should again be determined 
by the variation of an integral equation whose integrand is built from 
its derivative, and physical fields which interact with ip. To make this 
program work one first has to define the derivative of a section in an 
arbitrary vector bundle. It turns out that one can generalise our treat- 
ment of derivatives of vector fields (cf. Sect. 2.6) and define connections 
of general vector bundles (Kobayashi and Nomizu 1963). It is also possi- 
ble to define a notion of curvature for these generalised connections and 
— analogously to the case of general relativity — • one can interpret this 
curvature F in terms of physical fields which interact with the given ele- 
mentary particle. To obtain the complete system of equations one writes 
down a Lagrange function which depends on 0, its derivative, the cur- 
vature F, and perhaps other physical expressions. We denote all these 
physical inputs collectively hy 4>: M E and hy r (pr = (p + a. 
one-parameter family of sections such that $ has compact support. The 
equations which have to be satisfied by the physical system are then 
given by 




for all variations (pr of (p. This recipe is referred to as the variation of 
the Lagrange function C, The mathematical properties of equations of 

It is far beyond the scope of this book to explain what this actually is. Our 
only purpose is to vaguely set the following discussion into context. Readers 
who want to know more are referred to books on gauge theory. 
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this type are similar to the properties one encounters in the theory of 
minimal submanifolds (cf. Lemma 4.4.8 and the discussion following this 
lemma) . 

We will now discuss an example of this formulation, the physical 
system consisting of electrodynamics and gravity. To keep things simple 
we will assume that there are no other physical inputs. In particular, 
there are no electromagnetic sources, i.e., there no charges. Our Lagrange 
function will consist of two summands, 



£ = 



-'gray 



+ ^el- 



where £grav stands for the contribution from gravity and £ei fov the 
contribution from electrodynamics. 

Remark 5,34- If we had included electromagnetic sources we would also 
have to add at least two more terms: 

— A term £kin for the elementary particle which is analogous to the term 
m( 7 , 7 )jj ^3 in the mechanical example above, and 

- a term Cint which describes the interaction of the elementary particle 
with the electromagnetic field. 

The reader may already have suspected that the electro-magnetic field 
F is in fact the curvature of a generalised connection (Bleecker 1981) 
The connection 21 can be identified with a 1-form A which, however, is 
not invariantly defined. (This corresponds to regarding the Christoffel 
symbols as tensors.). With this 1-form A we have F == dA. The first of 
Maxwell’s equations, dF ~ 0, is then a trivial consequence. According to 
the program above we have to vary C with respect to the 1-form A. This 
will give the second of Maxwell’s equations, divF = 0. Gravity depends 
on two geometric quantities, the torsion-free connection V and the metric 
g. We will use the Palatini formalism^ i.e., we will independently vary L 
independently with respect to V and g. Observe that this independent 
variation makes sense only if we do not assume a priori that V is the 
Levi-Civita connection. It will turn out that the variation with respect to 
the connection will fix the Levi-Civita connection and that the variation 
with respect to the metric will give Einstein’s equation. 

The simplest non-trivial, invariant function Cel which can be defined 
is (modulo constant factors and modulo the addition of a constant term) 
given by 

£el(4 V,5) = ’ 

where (F, F) = g^-’^g'^'^FabFcd- 

Alternatively, one could assume that the connection is the Levi-Civita con- 
nection and only vary the metric. We have chosen the Palatini formalism 
because this is more akin to the treatment of other gauge theories. 
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Lemma 5.3.1, Let B € Ti{]\L) he a tensor field with compact support 
and consider the mriation 0 ,. -- (,4. , V.g) ~ (A -f tB, V,^). Then 

holds, where V is the Levi-CivUa connection of g and div is the diver- 
gence operator associated with V. 

Proof From dBi^d = - VdBb we get 

= -^[ {F,dB)fiM 

07T 

= -tj^ 9"V^iFac){VbBd - 

= I r"f‘^iFao)^bBdm 

Jm 

- 9‘'^9‘^ybFacBd)m 

= ^ (di^(F(-,B«)») - (^F“)(S))mm. 

The first summand vanishes because of the theorem of Gaufi and the 
fact that B has compact support. I 

Since the Cq\{A,V , g)p>M does not depend on V the variation with 

respect to V vanishes. 

Lemma 5.3.2, Let h e sym('7^°(M)) be a tensor field with compact 
support and consider the variation 4>r — (A,V,gr) — {A,V,g + rh). 
Then 

( j f {Tpm) ^ f {Tei)abhfi^ jJ^M 

holds, where Tqi is given by Equation (5.2.8) 

Proof. The equation 0 = ((5r)a6(5r)*^)|^=o ^«63'’‘'+5a6^(5r)*V=o 

implies 

Recall from the proof of Lemma 4.6.20 that the derivative of det(9?r)j 
where (pr is a 1-parameter family of matrices, is given by (det(i^T-))* = 
tr{(pip~^) detcp. This implies 
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(det(px))‘ = tr{h) det{gr), 

where in this formula tr is the metric trace of a covariant (2) tensor. 
Using these two formulas we calculate 

{^-M9rrH9rr''V- det(ff.)) A • • • A dx” 

= 2FacFbd9‘^'^h‘^W-det{g) 

+ ^ (F, F) V'-det(p)tr(h))dxi A - • • A dx" 



Remark 5,3.5, We have thus obtained the form of the electro-magnetic 
energy momentum tensor by variation of the simple Lagrange function 

-(167t)-1(F,F) 

with respect to the electro-magnetic potential A. For other matter fields 
analogous results hold. In this sense it can be said that variational tech- 
niques aid in finding the correct energy momentum tensor. 

For the gravitational term we set 

^grav(-^7 = Y^(RiC;a6^^^ ” 2ii), 

where Ric is the Ricci tensor with respect to the connection V. This is 
again the (modulo constant factors and summands) simplest invariant 
function which can be build from the metric g anc^the connection V. 
Recall that for any two torsion-free connections V, V the difference is a 
( 2 ) -tensor field K which is symmetric in its covariant indices. In index- 
notation, K is given by VyW^ = VyW^ -f for all vector 

fields V, W. We simply write V = V 4- iiT. 

Lemma 5»3,3. Let C e 7^^(M) he a tensor field which is symmetric 
in its covariant entries and which has compact support Setting (V^) — 
V -j- rC and 4>r = {A^ (Vr),y) we have 
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= "T^ + K’^dcg^'’ - 2K\>>)c\bm, 

where K is defined by V — V + K and V is the Levi-Civita connection 

of 9^ 

Proof, Since the Ricci tensor is the only quantity which involves (Vr) 
we obtain 

Let X e M and consider a normal coordinate system centered at x. Prom 

(Ric.),, = 9, {P^, + rC^,) - 5, J 

+ (^i) + rC^i,) + ^C'dc) “ (^ac + ^C^ac) i^db + '^^db) 

and the fact that the Christoffel symbols vanish at x we see without 
calculation that at x and for r ~ 0 the derivative of Ric^ is given by 

— (RiCT-)o6|^==0,x ~ ^c^ab\r=0,x ™ ^^^ac\r=0,x' 

Since this is a tensor equation which is independent of coordinates it 
must hold at all points of M. 

We will now re-express VcC"^5— VbC'ac respect to the Levi-Civita 

connection V and the tensor field K. FVom the definition of K we get 

- Kl^Cl 

and therefore 
^“''(VcC'^6 - VtC:,) = 

- VhCl, - Kiel, + Kiel, + Kiel) 

= f\^cei-v,ei) 

+j-'^ {Kiel - Kiel - Kiel + Kiel) 
= div(tr2,3C - tri,2C') 

+ {Klg-^ - 2g‘^^Kl + K^gf^Sl) ei, 



where tr^^^ denotes the (metric) trace over the ith and jth entry. Since the 
first summand is a divergence with respect to the Levi-Civita connection 
its integral vanishes by the theorem of GauB. I 
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Lemma 5,3.4. Let h € sym(^^(M)) he a tensor field with compact 
support and Qt ~ 9 '^h. For <fr ~ have 

— “ 20 ^ J ^Qa^h 

Proof We can split the integral into two parts which will be considered 
separately, 

= -rE f A.{ (RicabigrT'' - 2 A) \J- det(£?T)) dx^ A • • • A dx” 

iDTrJMdTV' /|r=0 

= ii /„ (<«')“'/- *‘(»'))i„„ 

^ . 

- 2^— \/- det(^r)ir=o)^^^ A • - A dx^. 

Exactly as in the proof of Lemma 5.3.2 we see that the second summand 
in the integral equals —Ag^^hab^ For the first summand we calculate 

(5,)“V-det(5r)) 

/ |r=0 

= ^ ((5r)“'’)|^=o V- det(5^) + (fi'r)“'’^V'-det(s-r)|^=o 

= -g‘"‘'9'’‘^hbdV- det(^) + ^g“'’g'^hcd\/- det(g) 
and therefore 

RiCad^ ((5r)“^\/-det(5r))|^^0 

= ^ - Ric“^ + ^RiCcd9'^‘^5“*’) habV~det{g). 




The following corollary is the main result of this subsection. 

Corollary 5.3.1. Let A be a 1-form^ V be a torsion- free connection, g 
he a Lorentzian metric and set F — dA. 

Einstein^ s equation and Maxwell^s equations for a source- free electro- 
magnetical field, 
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Ric- -Seal g-\-Ag = ^ (^g'^'^FacFbd ~ ^{F, F)gab^ , dF = 0, divF = 0 
are equivalent to 

^grav) ® ^r{l-^M)r = 0 

for all variations (j)r == (^r , {^r)i9r) = (A + rS, V + rC, ^ + r/i) ty/iere 
B,C^h are tensor fields with compact support. 

Proof We can consider the variations with respect to A, W^g separately. 
Since C is arbitrary Lemma 5.3.3 implies at each point x 

(K>>^dS^ + K'^dcg'^’’ - 2K\’>)c%b = 0 

for all tensors Cab which are symmetric in a and b. This is equivalent to 

/s:“d5c + + 2K'^dc9'^'’ - 2K^c^ - 2K\<^ = 0. (5.3.14) 

Taking the trace with respect to h and c we get 

0 = K°''^d + nK^^^d + 2K^d^ ~ ^K^^d^ - 2K^d‘" = {n- l)K'^^d- 

Taking now the trace with respect to a and b we get 



0 = 2nK‘^dc - ^K^cd = 2(n - 2)K\. 

These equations together with the symmetry of K^bc in b and c imply 
(for n > 2) that all traces of K vanish. Hence Equation (5.3.14) simplifies 
to and Kabc is a tensor with the properties 

Kabc ” ~P^bac’i P^abc — ^acb> 

We will now show that this tensor vanishes. Since it has the property that 
it is symmetric in two indices and anti-symmetric in two other indices, 
the expressions sym(jf^^) and alt(K'*') both vanish. This is equivalent to 



0 — ^abc + Kbca + Kcab + Facb + F^bac + F^ba 



and 



0 — Kabc + Kbca + Kcab ~ {Kacb + Kbac + Kcba)‘ 



These equations imply Kabc + Kbca + Kcab ~ 0 and therefore, using the 
symmetries of F^,JD = Kabc + Kbac + Kcab — Kcab> Hence K vanishes 
and we have V = V. 
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The equation dF — 0 follows trivially from the definition F = dA. 
The second part of MaxwelPs equations, divF = 0 follows immediately 
from Lemma 5.3.1 since B is arbitrary. 

Since h is arbitrary and we know that V is in fact the Levi-Civita 
connection the validity of Einstein’s equation follows immediately from 
Lemma 5.3.2 and Lemma 5.3.4. I 

The process leading to Einstein’s equation via Corollary 5.1.1 is referred 
to as varying the total Lagrangian C^\ + >Cgrav respect to the metric 
9 - 

We have chosen electromagnetism for our matter model in order to 
have a concrete example. To my knowledge all fundamental^^ matter 
models admit a Lagrangian formulation such that 

) [ ((^^matter ^grav)MM) ^ 4^r ~ b 

/ |r=0 Jm 

for all variations of the metric is equivalent to Einstein’s equation for 
the particular matter model. 

That the Lagrangian ansatz described in this section works is by no 
means trivial and I have no explanation for it. 



5.4 The Einstein equation as a system 
of partial differential equations 

Physicists are accustomed to the fact that (classical) physical systems de- 
pend on initial conditions and then evolve in a determined manner which 
is governed by second order hyperbolic differential equations. Since the 
energy momentum tensor T contains the metric, the Einstein Equation 
(5.3.11) cannot be simply solved for a given T. Instead, one has to con- 
vert the system, of Equations (5.3.11) into a system of partial differential 
equations for g and some matter quantities. 

The analogue in relativity would therefore be to fix an (n — 1)- 
dimensional Riemannian manifold {S^^g) which represents an initial 
instant of time. This manifold will be isometric to a spacelike hyper- 
surface in the solution. Since Einstein’s equations are a second order 
system we would need to prescribe a symmetric ( 2 ) tensor field k which 
specifies the normal derivative of the induced metric ^g or, equivalently, 
the second fundamental form of our hypersurface. We also need to fix 
functions or tensor fields which represent the initial matter distribution 
at 17, possibly also their normal derivatives. 

A perfect fluid is a macroscopic concept and the Lagrangian formulation 
does not work well in this case. See (De Felice and Clarke 1990, chapter 
6.5) for a discussion. 




280 



5. General relativity 



The character of this system of partial differential equations will cru- 
cially depend on the form of matter assumed. In particular, one can 
choose unphysical matter models which lead to spacetimes in which it is 
possible for information to travel faster than light (cf. Corollary 7.4.1). 
It is also possible to choose unphysical matter models which do not lead 
to a hyperbolic system of differential equations. 

Another problem lies in the fact that we have always the freedom 
to change coordinates. Hence the choice of coordinate system may also 
have an effect on the kind of system of partial differential equations we 
will end up with. 

Nevertheless, in most situations of interest, it is possible to obtain a 
well-posed system of equations. We will show this for the special case that 
T = 0 and A = 0. In order to avoid subtleties arising from the theory of 
partial differential equations we will assume that our initial data k are 
analytic and that E is an analytic manifold. (This restriction allows us to 
appeal to the relatively elementary theorem of Cauchy-Kowalewskaya.) 
We will also fix coordinates in which the equations are especially simple. 

In Chap. 6 we will study the more general case of a perfect fluid. 
However, we will impose strong symmetry assumptions in order to sim- 
plify the problem drastically (cf. Sect. 6.2) — the system of equations 
will be reduced to a system of ordinary differential equations. 

Chapter 7 contains an intermediate treatment. We will again consider 
a perfect fluid but use weaker symmetry assumptions which lead to a 
system of partial differential with two independent variables. This system 
of equations is substantially simpler than the general system depending 
on 4 variables. We will therefore be able to give a smooth (rather than 
an analytic) existence theorem (cf. Theorem 7.4.1). 

Since the analogous but considerably simpler discussion in Chap. 6 
already exhibits some of the key concepts of the initial value problem 
for Einstein’s equation, the reader may wish to skip the rest of this 
section on first reading. 

Let (M, g) be a Lorentz manifold C iW be a smooth, spacelike hyper- 
surface with normal^^ n. For each x e E consider the geodesic 7 a, with 
7rc(0) ~ There is a neighbourhood of E which is foliated by these 
geodesics. If this neighbourhood is chosen small enough it is also foliated 
by spacelike hypersurfaces of the form Et = {7x(t) '• a: € 17}. 

If one views M as being foliated by spacelike hypersurfaces Et {Eq = 
17) with induced metric ^^g then one can view the associated second 
fundamental form kt as the t-derivative of ^^^g- 

Lemma 5.4.1. Let E be a spacelike hypersurfaces of (M,g) and {i7t} 
be a foliation of a neighbourhood as constructed above. Let . . . , 
be a coordinate system of E centred at x e E. 



14 



Here we mean: p(n,n) = -l, 5 F(n,v) = 0 for all v G TE. 
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Then there is a neighbourhood U of x e M such that g = + 

T!i=i (* . • • • , ^ )da;Mx^' . 

Moreover^ the second fundamental form kt of Ut i'S given by kt = 




Proof, We can find a neighbourhood U of x such that for every point 
y eU there is exactly one point x e SDK and one geodesic through x 
which satisfies 7 ^( 0 ) = and intersects S exactly once without leaving 
U, This gives a chart (W,(p) defined by (f{y) — 
where y = 7a (^)- 

It follows from our construction that the induced metric on Et is 
given by ^^g = • • • ,x'^~^)dx^dx^ , where gij are suitable 

functions. At t = 0 we have for each x e E 

n-l 

= -dt^+ ^ 5-ij(0,x^(x),...,x”~^(x))dxMx^ 
since 7 a (0) — Ux E E ~ Eq. Prom 

= {^t,^d^A) = \^d^i == 0 

we get 7a (^) J- This implies the claim for the metric compo- 

nents. 

From Lemma 4,4.6 we get 

kt{da,i,d^j) = = {^dA‘,dxi) 

= dt»9ij - (d^i,ydAi) = dt»gij - h{d^i,d^i). 

The assertion kt = follows now from the symmetry of kt and 

from 



( ^^9) ij — dt i^cch ^x’} ) 



=0 



=0 



= dA'gij - ^'■gi^dA'Axi) - ^*g{dx^AdAi) = 9tgij. 



We denote the Levi-Civita connection induced on Et by and the 
Ricci tensor of (Et,^^g) by ^^Ric. 
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Lemma 5.4.2. Einstein’s equation with vanishing cosmological constant 
for vacuum is equivalent to the following system of equations, 

0 = -^'Scal + l(dt^*gijdt^^g^i - 
0 = -d,i(^^g^%^^gjk) + 

Proof. Einstein’s equation is given by Ric - = 0 which is equivalent 

to Ric = 0. The Gaufi equation (Proposition 4.4.1) and Ric = 0 im ply 

^'Ric(t/, W) = {R{du U)W, dt) - ti{kt)kt{U, W) 

+ ^*9{kt{U,-)Kkt{W,-)^) 

^ {R(U,dt)dt,W) 

- \{dt^%A^'9kl - dA9ikdA9jAg^‘Y^g>^^U^W^. 

In order to simplify the term {R(dt, U)W, dt) we may assume that U, W 
can be extended to vector fields U, W which are everywhere tangent to 
St and commute with dt. Using Lemma 4.4.4 we obtain 

{R{u,dt)dt,w) = (vu'^t-SdAu^uw'^ 

= - {^dAu9u W) = - w) 

= -(VQkt{U,.f,w) 

= -\dt • {£q^^<9{U,W)) + ^-£q^^^9{U,Sq^W) 

= -\£d/dt'''9{U, lU) + \£q^^' 9{U. £dt''^9{W, •)“). 

= ( - \dtdA9ij + \dA9ikdA9ji9’^^)U^Wi . 

Hence the spatial components Ric^j = 0 of Einstein’s equation are equiv- 
alent to the first system of equations in Lemma 5.4.2. 

Prom 

= Rijkiig^ 4- ® dtY^{g^ -\-di0 dtY^ 

- Seal 4 2Ric(at,5t) = 2(Ric » ^g){dt,dt) 
and the Gaufi equation we get 

Ric(0t,5t) = -l(Scal + ^‘Scal + tr(fet)2 - \\kt\\^). 
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Hence Ric = 0 implies ^'Scal + \ ~~ — 0- 

Let {Ely . . . , En~i} be an orthonormal frame of St and be a vector 
field which is tangent to Et> Then the Codazzi equation (Proposition 
4.4.2) implies 



n— 1 

Ric{dt, U) = Y^{R{Ei,dt)U,Ei) 

i=l 

n-1 

= 'E(('^u(^t®dt){Ei,Ei),dt) 
i=l 

-(VE,(kt®dt)(U,Ei),dt)) 

= E ( dt) - ((V E, • h){U, Ei)du dt) ) 

i=l 

n-1 

i=l 

= -U • ti{kt) + ^'div{kt){U). 

= i( - 

Hence Ricjj = 0(i€{l,...,n-l})is equivalent to the last system of 
equation in the statement of the lemma. I 

The first system of differential equations in Lemma 5.4.2 consists of 
|n(n“ 1) coupled differential equations for the |n(n- 1) unknown func- 
tions Qij ^ Qji {ije , n - 1}). One would expect that these equa- 

tions would uniquely determine g and that therefore Einstein’s equation 
would be over-determined. Since over- determined systems of differential 
equations have only very few solutions (if any at all!) and are in general 
incompatible with initial value problems, Einstein’s equation seems (at 
first sight) to be very different from other equations in physics. However, 
the following lemma shows that the over-determinacy of this system is of 
a very special nature and in fact compatible with a (slightly restricted) 
initial value problem. 

Lemma 5.4.3 and Theorem 5.4.2 below hold for initial data which are 
not necessarily analytic. However, the proof is then much more difficult 
since we cannot anymore appeal to the relatively elementary theorem of 
Cauchy-Kowalewskaya. 

Theorem 5.4.1 (Cauchy-Kowalewskaya). 

Let F: and /q: R"^“^ ^ R^ be analytic maps. Then there 
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is a neighbourhood U C of x'^ = 0 and a unique analytic map 
/ : ZY — > K* which satisfies the system of partial differential equations 

d,.nf = F(x\. ..,X"^,d,lf,..., ^,.n-^f). 

and the initial conditions /(O, = /o(a;^, . . . 

Proof (sketch). The idea of proof is to determine the Taylor series of / 
at a; € {x'^ = 0} by successive differentiation of the system of partial 
differential equations and then to show that this series converges. A 
formal proof can be found in (Dieudonne 1971). I 

The theorem of Cauchy-Kowalewskaya rests on the fact that an analytic 
function is determined by its Taylor series and it does not hold when 
the word ''analytic” is replaced by “smooth” . In the non-analytic case 
the structure of the system of partial differential equations matters for 
both, existence and uniqueness of solutions. This fact indicates that by 
restricting to the analytic case one may (in general) obtain results which 
are misleading because they do not generalise to the smooth case. 

Lemma 5.4.3, Let {M^g) be a real-analytic spacetime, be a 

foliation as in Lemma 5,4 A, and assume that the spatial metric compo- 
nents Qij (i,j G {1, . . . ,n — l}j satisfy 

dtdt^‘9ij = -2^'Ric<j - (^dtPijdtgki - dtgikdt9jig^)g'‘^ . 

If at U = Uq the “constraint equations ” 

0 = ^Scal -h (tr(/c))^ - ||/c||^, 0 = -dtr(fc) -f ^div{k) 

hold then {M,g) satisfies Einstein^s vacuum equations with vanishing 
cosmological constant, Ric = 0, 

Proof Let Ric be the Ricci tensor associated with g ~ -d^^ + gijdx'^dx^. 
By assumption, this tensor satisfies Ric^^ == 0 for all spatial compo- 
nents. The identity div(Ric- -Scal^) == 0 implies therefore g"^^(daRicbc- 
2P^c^icdb) — ^cScal = 0 which is equivalent to 

n-l 

0 = -dtRictc - ^jRicic - ^cRictt - 

i=l 

Since Ric^j = 0 this is a linear system of n partial differential equa- 
tions for the unknown functions Ric^t, RiCi^. The constraint equations 
are equivalent to Rictt = Rict^; = 0 at Hence we have Rici^ = 
Rict^ = 0 everywhere by the uniqueness-part of the theorem of Cauchy- 
Kowalewskaya. I 
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Theorem 5.4.2. Let be an {n — 1)- dimensional real- analytic 

Riemannian manifold and k G sym{T^{S)) be a real-analytic tensor 
field which satisfies 

0 ” ^Scal + (tr(fc))^ “ \\k\\^, 0 = ”dtr(fc) + '^div(fc). 

Then there is an n-dimensional real-analytic Lorentzian manifold (M, g) 
and an immersion i\ E M such that ~ ^g and = k, where 
ko is the second fundamental form of t{E), 

Moreover, if (M, g) is a second Lorentz manifold with these properties 
then i{E) C M and Z{E) C M have neighbourhoods which are isometric. 

Proof. Fix a coordinate system for E and consider the system of partial 
differential equations 



dt{h)ij = -[^"^Cij]{^*g,{{ht){k])k=i,...,n-i, i(ht)[kl])k,i=i n-i) 

- (ikt)ij(kt)ki - 

dt{ht)[k]ij = 2d^k{kt)ij, 

9tikt)\kl\ij = 2dxkd^i{ki)ij, 

where [^^RiCij]{^*g, {{ht)[k])k=i„..,n-i, {{kt)[kl])k,i=u...,n-i) is the alge- 
braic expression defined by 

[^‘Ricy](^*5', {d^h^^g)k=i,...,n-i, {d^i‘d^i^*^g)k,i=i,...,n-i) = ^*RiCij. 

The theorem of Cauchy-Kowalewskaya implies that for any real- analytic 
set of initial values fco, ((/io)[fc])A:=i,...,n-i. ((fio)[fc^])M=i> -,n-i}. 

there is a neighbourhood U oft = 0 and a unique solution of the system 
of partial differential equations which is defined on U and has these ini- 
tial values. Since dt^^gij - 2{kt)ij the equation dt{ht)[k]ij = 2dxk{kt)ij 
implies 

0 = dt{ht)[k]ij — d^kdt *gij = ^t{{f^t)[k]ij ~~ d^k ^gij)> 

From an integration of this equation we see that (ht)[k]ij — d^k^^gij if 
(ho) [k]ij — d^k . In the same way we see that {ht)[kl]ij = d^k d^i ^*gij 
if {hQ)[kl]ij = d^kd^i^'^gij. It follows that the solutions ^^gij of this 
system of equations also solves 

dtdt^'gij = -2-^'Ricy - {^dt^%jdt^>^gki - 
if and only if the initial conditions 

= 2(fco)y, {ho)[k]ij = d^k^'^gij, {ho){kl]ij = d^kd^i^^gij 
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hold. Now the assertion follows directly from Lemma 5.4.3. I 

Theorem 9.4.1 is local in character. Also note that the coordinates chosen 
tend to develop singularities due to focusing effects (cf. Proposition 4.6.1) 
and observe that the geodesics t (t, . . . , o:^) are length maximising. 

A discussion of the smooth case can be found in (Hawking and Ellis 
1973, chapter 7), An improved but mathematically more sophisticated 
theorem is presented in (Hughes, Kato, and Marsden 1977) 




6. Robertson- Walker cosmology 



6.1 Homogeneity and isotropy 

It is very difficult, and one cannot make with any certainty assertions 
about the universe as a whole. This is so because we only know a very 
small portion of the universe. Hence any cosmological model reflects our 
own prejudice. Nevertheless, there are certain assumptions which seem to 
have a high degree of plausibility. After having built a cosmological model 
one can compare it with the few data we do have. Although imperfect, 
this approach seems to have given us much deeper understanding of the 
development of the universe than would seem possible at first sight. 

The first cosmologists placed the earth at the centre of the universe. 
Copernicus’ revolutionary model gave us a much more humble place in 
the solar system — the earth was reduced to being just one of its planets. 
This model of the universe had such a success that nowadays we not 
only take it for granted but don’t even sincerely doubt that there may 
be other (more advanced) forms of life in the universe. At Copernicus’ 
times, such a thought would have been considered blasphemous. The 
monk Giordano Bruno (1548-1600) was burned because he asserted the 
truth of such ideas. ^ Our new modesty leads us to think that our place 
in spacetime is in no way exceptional, and that there are no exceptional 
places anywhere in spacetime. We will use this fundamental idea to build 
a cosmology. 

Let a; € M be our event in spacetime (M,^) and Ux € TxM be the 
velocity vector of our world line. If there is not any point (or direction) 
in spacetime which is special then the universe should be isotropic, i.e., 
it should not be possible to distinguish any direction in by physical 
measurements. Although a glance at the nocturnal sky indicates that 
this is at odds with experience, on a sufficiently big scale this assumption 
coincides very well with observation. The galaxies seem to be randomly 
distributed, and since at night we mainly see a part of a single galaxy 
(the Milky Way), our first impression is not very representative. 



He is sometimes styled and an important forerunner of enlightenment. I 
must confess that I find his book (Bruno 1584) quite unscientific. 



M. Kriele: LNPm 59, pp. 287 - 305, 1999 
© Springer- Verlag Berlin Heidelberg 1999 
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The mathematical interpretation of the isotropy assumption is that 
it is impossible to construct geometrical objects, using and which 
are breaking this symmetry. Let E be (any) 3-dimensional subspace of 
the (n - l)-dimensional space and u^v^w be vectors in E. Since 
R{v, U)U isa, vector, isotropy about Ux and the fact that (i?(u, U)U, U) = 
0 imply that R{v^U)U = fj,{x)v for some scalar Consider any 2- 
plane P = span{u, u} in E. The sectional curvature K(P) should be in- 
dependent of P since otherwise there would be a plane Pq of maximal sec- 
tional curvature which in turn defines a distinguished direction in E. 
Since by isotropy there should not be any distinguished direction in E we 
conclude that at x the equation R{u,v)w ~ k{x) {{v^w) u ~ {u^w) v) + 
c(u, u, w)U holds. We show now that c = 0. The vector 

RiP) = ({u,vf-\\u\\-\\v\\r^/^R(u,v)U, 

lies in and depends only on P (rather than on the representatives 
u^v). Denote by irp the orthogonal projection U-^ — > P. For each vector 
n G P with g(n^n) = 1 let P„ be the plane in E orthogonal to n. Since 
7Tp„ (P(Pn)) lies in Pn and is therefore orthogonal to n we obtain a vector 
field 2J: n 7rp„(P(Pn)) on the 2-sphere {n G P : g{n,n) — 1}, Since 
the 2-sphere is compact the vector field has constant length since 
otherwise there would be a distinguished direction of maximal length 
in violation of isotropy. By Theorem 2.5.11 this is impossible unless the 
vector field vanishes identically. 

By isotropy, it should have no component in P since otherwise there 
would be a distinguished direction in P. It follows that R{P) is orthog- 
onal to P. Further, its length cannot depend on P since otherwise there 
would be a distinguished plane (and therefore a distinguished vector) in 
P. 

Let u^v^w £ E. From 7rspan{ui,^i2}(-^^P^^{^ii "^s}) = 0 for all vectors 
ui^U 2 £ E we obtain 



=0 

{R{u,v)Ux^w) = {R{u,v -\-w)Ux,w)) - {R{u,w)Ux,w)) 

=0 

/ s 

= {R{u,v + w)Ux,v + w)) - {R{UjV + w)Ux^v)) 

=0 

r -s 

= - {R{u,v)Ux,v)) - {R{u^w)Ux,v )) . 

This implies that the tensor field (u^v^w) h-j- {R{u^v)Ux^w) is antisym- 
metric. Since {R{u^v)Ux^w) — ~ (R{u,v)w^Ux) = c(u,u,tz;), c is a 3- 
form. The first Bianchi identity (Lemma 2.8.2) yields {R{u^v)wyUx) + 
(R{v^w)u^Ux) + {R{w^u)u^Ux) = 3c(u, u,ry) — 0. This motivates the 
following definition. 
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Definition 6.1.1, Let {M^g) be a Lorentzian manifold and € T^M, 
{Ux^Ux) = — 1. The spacetime {M,g) is called infinitesimally isotropic 
about Ux if at x the curvature tensor satisfies 

R{uj v)w — k{x) {{v, w)u~ {u, w) v) 

R{v,Ux)Ux = 

for all u,v,w G where fa{x)jK{x) € R are independent of u,v,w. 
The spacetime (M, g) is called infinitesimally isotropic if there exits a 
normalised) timelike vector field U such that (M, g) is isotropic about 
Ux for all X £ M. If (M^g) is infinitesimally isotropiCf U is called a 
cosmological observer field. 

Given an infinitesimally spacetime, there may not be a unique cosmolog- 
ical observer field. For instance, in Minkowski spacetime all normalised 
timelike vector fields are cosmological observer fields. 

In the following we will assume that spacetime is infinitesimally 
isotropic. While it can be argued that infinitesimal isotropy about our 
own velocity vector is backed experimentally fairly well, it is a very 
questionable extrapolation to assume that spacetime is infinitesimally 
isotropic. On the other hand, this extrapolation seems to be exactly the 
lesson learned from Copernicus. Hence to demand that (M, g) is infinites- 
imally isotropic appears very plausible to us. (Our acceptance of such 
a postulate is in striking opposition to the response a medieval scholar 
would have given). 

Lemma 6.1.1. Let (M^g) be infinitesimally isotropic about Ux- 
Then 



R{% v)Ux “ 0 and R{Ux, u)v = (w, v) Ux 

for all UyV ^ U^ and the energy momentum tensor is given by 
Tx = {e(x) + p{x))Ul P{x) 9x, 



where 



S'Ke{x) — “(n ~ 2)(n - l)/^(a:) ~ A, 

%7^p{x) = (n - 2) ^-“(n - 3)k{x) -h p>{x)^ + A, 

Proof Let u^v^w G U^. The first assertion follows from v)Ux,w) = 
— {R{U)V)w,Ux) = 0 and {R{U)V)Ux^Ux) = 0, The second asser- 
tion is a consequence of {R{Ux)U)V)W] = {R{V)W)Ux^u) = 0 and 
(R{Ux,u)V)Ux) — {R{U)Ux)Ux,v) = ja{x) (u^v). Taking the trace of R 
we obtain 
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Ric{U^, = (n - l)fJ.{x), Ric(C/a;,t;) = 0, 
Ric('i^, v) = ((n — 2)k{x) — fi{x)) {u^ v ) , 



and therefore 

Ric ~ ((n - 2)jj> + (n - 2)k) + ((n - 2)k - fi) g. 

Taking again the trace we have Seal = -(n - 2){fi + /^) + 
n ((n “ 2)k ~~ ij,) — (— 2n + 2)/uH- (n — l)(n - 2)k. The energy momentum 
tensor is now given by 

87tT = Ric — -Scal^ + Ag 

4- ^(n -2)k~ fj,-~ ((-2n + 2)/i + (n - l)(n - 2)/^) g 

= (n - 2){fi + k)U'^ -h ^(n - 2)^-^k -j- (n - 2)/^ 4- g. 

It follows that the energy density c and the pressure p are given by 
87r(e +p) = (n - 2)(p 4- k) and 87rp = \{n - 2)(3 - n)K + (n - 2)p H- A, 



Lemma 6.1»2. Let (M^g) be infinitesimally isotropic and U be the cos- 
mological observer field. Further assume that n > 3 and that e + p ^ 0, 
Then U-^ is an integrable distribution. The hypersurfaces perpendicular 
to U are totally umbilic (cf Definition 4>4>V <^'^d n is constant on these 
hypersurfaces. 

Proof. Let T, Z be vector fields which lie in U-^ at x and satisfy the 
equation 

VijX - (X, VijU) U = 0 

(analogously for Y, Z). This can easily be arranged by considering vec- 
tor fields along a hypersurface X which lie in They may then be 
uniquely extended into a neighbourhood of X using the above differ- 
ential equation. From Xjj{X,U) = {VjjX ~ {X,XjjU)U,U) = 0 it 
follows that Z are everywhere perpendicular to U. Moreover, the 
derivative XjjX is parallel to U since VjjX = — (yjjX^U)!/ (anal- 
ogously for Y, Z). We will now exploit the second Bianchi identify (cf. 
Lemma 2,8.1), 

{VuR) (X,Y) + (VxR) {Y, U) + (Vyi?) {U,X) = 0. 

Using the formulas in Definition 6.1.1 and the property 
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VuX,VuY,VuZ \\U 



we calculate 

(V[/i?) {X, Y)Z = Vu {R{X, Y)Z) - R{X, Y)VijZ 

- R{XijX, Y)Z - R{X, VijY)Z 
= dK{U){{Y,Z)X-{X,Z)Y) 

+ k{{Y,Z)VuX - {X,Z)VuY) 

-0 + 1:1 (Y,Z) XxjX - M {X, Z) XuY, 

(XxR) (Y, U)Z = Xx (RiY, U)Z) - R{t:u- {X xY) , U)Z 

- R(Y, XxU)Z - R{Y, U)XxZ 
= V;^ (/X (F, Z)U)-iJL {Z, XxY) U) 

-n{{XxU,Z)Y-{Y,Z)XxU) 

- m (- < 17 , XxZ))Y - (1 {Y, XxZ) u 
= d^i{X) {Y,Z)U 

+ (k + m) ((F, Z) XxU - (F, XxU) Y ) , 

and 

(VyjR) {U, X)Z = - {XyR) {X, U)Z 
= -dfi{Y) {X,Z)U 

- (k + m) {{X, Z) XyU - {Z, XyU) X) . 

Inserting these equations into the second Bianchi identity we obtain 

0 = d/i(X) (F, Z)U - d/r(F) (X, Z) U 

+ {,i + k) (( F , Z) XuX - {X, Z) XjjY) , ( 6 . 1 . 1 ) 

0 = dK(U) ((F, Z)X- (X, Z) Y)-(m + k)((Y, Z) XxU 

- {X, Z) XyU + {XyU, Z)X- {XxU, Z) Y) . ( 6 . 1 . 2 ) 

Since n > 3 there are pointwise linearly independent vector fields X, Y 
with X 1. Z,Y 1. Z. For these vector fields Equation 6.1.2 implies 

0 = {ii + k) {{XyU, Z)X- {XxU, Z) F) . 

It follows immediately that for orthogonal vector u^v the expression 
(u^VyU) vanishes. Hence, restricted to the bilinear form VC/^ is 
a multiple of g restricted to U^, In particular, VU'^ restricted to 
is symmetric. We will now show that the tensor field V((ac -h jJi)U) is 
symmetric in all of TxM, Since U is symmetric on U'^ and V{{n-\-g)U) = 
(d« + d^) ® [/ + (k: + jY)VU we only have to show -\- fj,)U)jU) = 
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{Vij{{k + n)U),X). Setting Y = Z and choosing X 1 Z X we get 
from Equation 6.1.1 

A^l{X) = (Vf/X, U) = -{^l + k) {X, V^U) . 

Lemma 5.2.1 implies 



dp(X)--(6 + p) 

and therefore ~^{n - 3)dK{X) + d/x(X) = -{fi + k) {X,VjjU), where 
we have used the formulas provided by Lemma 6.1.1. Combining this 
equation with dfx{x) = -(/i + k) (X, XjjU) we obtain dK{X) = 0. Now 
we can calculate. 

(Vx((« + u) = -d(K + At)(x) = -dM(X) = (// + «) (X, VifU) , 

(Vf;((« + ti)U), X) = (/X -k k) {VuU, X) . 

It follows that V({k-\-/j)U) is symmetric in all of T^M and that therefore 
d(^{K> — 0. By the lemma of Poincare there is a function t with 

dt — (j>)U^ , The hypersurface = {rr € M : t{x) = f} is orthogonal 

to U for each i. Its second fundamental form, k{u^v) ™ (u, is a 

multiple of g restricted to which implies that is totally umbilic. 



Theorem 6.1.1. Let {M,g) be infinitesimally isotropic and U be the 
cosmological observer field. Assume that e A p ^ 0. Then there is an 
(n-l)- dimensional Riemannian manifold (M^gs) of constant curvature 
£ e {1, 0, —1} such that (locally) M — Rx M, g ~ —dt‘^ + where 

e/a^^K-l{dK{U)/{KAp))^, 

Proof, It is clear that ^ 4- if)~‘^dt^ + gt for some ^-dependent Rie- 

mannian metric gt- Let u, u 6 U-^ be vectors with u J_ u, (u^u) — 
(u,u) = 1. Then Equation (6.1.2) gives with v ~ X,,u — Y — Z 
dniU) = u) + {XyU, u)). Hence the second fundamental 

form of the hypersurfaces perpendicular to U is given by 

, / V 7 rr\ ldK{U) 1^ 

The GauB equation gives 

Rst ('^5 w)k{v^ -)^ ~ fc(u, w)k{u, -)^ 
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By the Lemma of Schur (Proposition 4.3.4) the factor n — ^ 

constant in the hyper surfaces. In other words, the hyper surfaces St have 
constant curvature. We can therefore write ^ 

where a: M R’*' is a function and the (n — l)-dimensional metric of 
constant curvature e e {— 1,0,1}. 

We will now show that the factor 1 /(k -h /i) can be absorbed into 
dt. To this end we have to show that g only depends on t (recall from 
the proof of Lemma 6.1.2 that dn{X) — 0 for each X e T^St and that 
therefore /c depends only on t). Since k is constant in the hypersurfaces 
t — const, so is dtK. This implies that 



, dtK dtK 2 . 

^ — ^ g — q 

2 2 



depends only on t. On the other hand, a direct calculation gives for 
v^w & 



k{v, w) = {v, VyP) = (« + /i) {v, ) = {k + n)gabv‘T^tw'' 

= (« + 1 “)^ {9c9at + dtQca ~ daQct) + li)dt9caV°‘w'' 

= (/i + n)adtags. 



Comparing the two formulas for k we obtain that 



f^=- 




depends only on t. Hence (K{t)+g,{t)) ^dt is the differential of a function 
which we take as our new time coordinate. I 



Corollary 6.1.1. If {M,g) is infinitesimally isotropic then it is also 
spatially homogeneous ^ i e. for hypersurfaces St orthogonal to U and all 
x^y e St there exists for any pair of orthogonal frames ofT^Stf TySt 
an isometry M —» M which maps one of the frames into the other. 

Proof This follows from Corollary 4.5.1 and Lemma 4.5.5 since {Sta'^{t)gs 
is a Riemannian manifold of constant curvature. I 



Corollary 6.1.2. Let (M^g) be infinitesimally isotropic and U be the 
cosmological observer field. Then there is an interval I — (^_,^+) and 
a spaceform (S^g^) of constant curvature e € { — 1,0,1} such that the 
universal cover of (M^g) is isometric to (J x 17, ~dt‘^ + ge) and U — dt. 

If n — A then there are local coordinates (t, r, such that 

g — -d^^ + a^{t) + r^ (d0^ + sin^(6*)d(^^)^ . (6.1.3) 
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Proof, The first part of the corollary is obvious since I x S is simply con- 
nected for any spaceform U. The second part follows immediately from 
the classification of 3-dimensional Riemannian manifolds with constant 
curvature (cf. Lemma 4.5,5). I 

A spacetime (M^g) which is locally isometric to a 4-dimensional in- 
finitesimally isotropic Lorentzian manifold is called a Robertson- Walker 
spacetime or Robertson- Walker cosmology. The metric given by Equation 
(6,1.3) is called the Roberts on- Walker metric. 



6.2 The initial value problem 

for infinitesimally isotropic spacetimes 



In this section we solve Einstein’s equations Ric-^Scalp + A^ = SttT for 
Robertson- Walker cosmologies. While, in general, Einstein’s equations 
give rise to a system of partial differential equations, in the case at hand 
we have already shown that the unknown functions depend on only one 
variable. We will therefore obtain an ordinary instead of a partial system 
of differential equation. This simplifies the problem greatly. However, 
even this simple case exhibits typical aspects of Einstein’s equation. 

We denote the derivative with respect to t with (•)^ 

Lemma 6.2*1. Let (M^g) be a Robertson- Walker spacetime and u,v,w 
be tangent to the hypersurfaces St which are orthogonal to U. The cur- 
vature expressions are given by 



R{u, v)w = ( — — + 1 ((u, w)u- {u, w) u), R{v, w)U = 0, 

\ a a j 

R{v,V)U = -—v, R{v,U)w = -—{v,w)U, 
a a 

Ric(C/, U) = -(n - 1)^, Ric{U,v) = 0, 



Ric(u,u;) = f h (n - 2) 



(a^)^ ^ \ \ / 

—r + -2)]{vM 



Scal = (n-l)(2^+(n-2)(<^ + i)) 



Proof The constant curvature metric 



gu = (d^^ 4- sin^(^)di^^) 



has the curvature tensor Ri;{u^ v)w = s{{v^ w)u- (u, w) v) (Proposition 
4.3.3). Hence the formula for R follows from Lemma 4.4.14. The other 
formulas are direct consequences from Lemma 4.4.15. I 
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Corollary 6.2.1. If the energy momentum tensor is given by 
T = {€ + p)U'’®U^ +pg, 



then EinsteMs equation is equivalent to 

8« + ^ = i(n-l)(n-2) (^ + j) 



and 



^ , / a'' n - 3 

S'Kp-A = -(n - 2) 1^— + 




(6.2.4) 



(6.2.5) 



Proof, Recall from Lemma 6.2.1 that the Ricci tensor Ric restricted to 
the spatial subspace is a multiple of the metric g restricted to this 
subspace. It follows that Einstein’s equation Ric — ^Scolg Ag === SttT 
restricted to this subspace. 




n — 1 
2 



(2^ + (n-2) 





+ ^)g\s 



is equivalent to Equation (6.2.5). Prom Lemma 6.2.1 we get Ric(u, U) ~ 
0 for all vectors v L U which implies that the only other non-trivial 
component of Einstein’s equation is given by evaluating it on the pair of 
vectors {Ux^Ux)> We obtain 

+ 2^ (2^ + („_2) ((^ + ^)) 

which is equivalent to Equation (6.2.4). I 



Observe that Einstein’s equation is not a well posed system of differen- 
tial equations. Instead, we have only two equations for three unknowns, 
a, e,p. Moreover, only derivatives of the function a appear in our system 
of equations. The first problem has a direct physical resolution. Just 
specifying a perfect fluid is not enough to specify a matter-model com- 
pletely. Rather, perfect fluids give a framework which is fitting for many 
different matter models. In particular, vacuum is a (very degenerate) 
perfect fluid, and so is dust. In order to arrive at a determined system of 
equations we therefore have to specify an additional relation between the 
energy density e, the pressure p, and the metric described by a. In the 
following we make a rather simplistic assumption, namely that there is 
a given equation of state: p = /(e) for some smooth function / : M — ^ E. 
Having made this assumption, we still have the problem that we have 
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two differential equations for a rather than a system of two differential 
equation for a and e. We can resolve this problem by replacing one of 
our equations with an equation of motion (Lemma 5.2.1). 

Corollary 6.2.2. Assume that there is a smooth function / : R -->■ R 
with /(e) > — e for all e € R. Let ao e R"*" \ {0} and cq eR and assume 
ihat + yl) “ e > 0. Then there exists a unique solution 

(a, e) of Einstein’s equations sucha(0) = ao and e(0) = eo- The functions 
a, e satisfy 

(i) (n - 2)^ = -Stt ^/(e) + ^e) + 

(ii) (n-l)^ = -^. 

and 

Proof Assume first that a, e are a solution of Einstein’s equation. Equa- 
tion (i) is a linear combination of Equations (6.2.4) and (6.2.5). Equation 
(ii) follows from Lemma 5.2.1. Finally, the equation for a'(0) follows im- 
mediately from the equation for e in Corollary 6.2.1. 

For the converse notice first that for our initial conditions there is a 
unique solution a, e which satisfies the system of equations (i), (ii) and 

a'(0) = (87reo + A) - We have to show that this solution 

is also a solution to the system of equations given in Corollary 6.2.1. It 
is clear that this system of equations is satisfied at t = 0. We will show 
that the first equation is satisfied for all t. Since the second equation is 
a linear combination of the first equation and equation (i), it must then 
also be satisfied for all t. Defining 

(j):=Sn€ + A- i(n - l)(n - 2) (1^ + ^) 

we have to show that (j) vanishes for all t. Taking the derivative of <j> and 
using equations (i), (ii) gives 



4 >' = STre' - “(n — l)(n - 2) 



2a'al'c? — 2aa' ((a')^ + 



a' fa" {a'r + e 



<i) -s.(„ _ I)l(e + p) - (N ^ 1)(« - 2)^ ^ 

(^) — ^ — 87r(n - l)c H- (n - l)(n - 2)^ + 87r(n - 3 )e + 2 A 

a \ a 
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Since (j)(0) = 0 and 0 is a solution of the differential equation <p' = 2^ (j>^ 
the fundamental theorem for ordinary differential equations implies that 
(j) must vanish for all t. I 

This solution of Einstein’s equation is typical in two aspects. Firstly, it 
is often advantageous to exchange part of the original set of equations 
for the equations of motion, div(T) = 0. Secondly, Einstein’s equations 
are not a free system of differential equations but are constrained. Recall 
that we were not free to choose a'(0) even though we had a second order 
equation for a. In other words, only a restricted set of initial values 
had the chance to lead to solutions of Einstein’s equation. The system 
of differential equations which was solved was derived from Einstein’s 
equation but not identical to it. We had therefore to show that the 
solutions to this system are also solutions to Einstein’s equation. We 
did so by deriving an additional linear differential equation and used 
our constraint (i.e., the choice of a'{0)) to show that the solution of 
this equation implies that the original set of equations is satisfied. This 
phenomenon has a direct counterpart in more general settings where we 
have to deal with systems of differential equations. 



6.3 Geodesics and redshift 

In 1929 Hubble made a cosmological discovery which implies that distant 
galaxies are moving away from us (and each other) at a rate proportional 
to their distance. This astronomical fact shattered the long cherished idea 
that our universe was an eternal arena in which the physical processes 
take place.^ It is instructive to describe Hubble’s discovery in slightly 
more detail: Each star has a spectrum of light which contains char- 
acteristic gaps due to absorption of light of certain frequencies in the 
atmosphere of the star. Since we have physical explanations for these 
absorptions, we can calibrate these patterns and thereby obtain infor- 
mation about the chemical composition of the star’s atmosphere. Hubble 
discovered that for stars in galaxies which are not too close^ these gaps 
are shifted towards smaller frequencies. Moreover, this shift is propor- 
tional to the distance of the galaxy. Prom his observation it was then 

^ Einstein introduced his cosmological constant a decade earlier because he 
wanted to have static solutions in accordance with the prejudice of his time. 
Had he not done so, there would have been another striking prediction by 
general relativity, 

^ For very nearby galaxies the (local) movement of the galaxy relative to us 
overshadows this effect. 
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concluded that all galaxies are moving away from each other. (Everyone 
is familiar with an analogous effect; If a fast car is approaching one has 
the impression that the noise of the engine is higher pitched than when 
it is moving away: In other words, if the source and the detector of a 
sound move away from each other, the frequency of the sound appears 
to be smaller). 

In this section we will show that Hubble’s discovery can be under- 
stood within the framework of Robertson- Walker cosmology (cf. Corol- 
lary 6.3.2 below). This is one of the great successes of general relativity 
and the isotropy assumption. Recall from Sect. 1.4.3 that we can describe 
the world lines of photons by null geodesics. The energy of a photon 7 
measured by an observer u is given by E = hu ^ - {u, 7). Here z/ denotes 
the frequency of the photon (as measured by u) and h denotes Planck’s 
constant. In Robertson- Walker spacetime we have a natural unit vector 
field U which is approximately tangent to the world lines of the galaxies. 
We will therefore define the energy of a photon using this distinguished 
observer. In this section we will always refer to this energy. 

Let 7 be a photon which moves from x e M to y E M, In general, 
it is possible that its energy is not constant along the world line of the 
photon. This is traditionally expressed using the fractional increase z of 
the associated wavelength A = 1/z/ = h/E: 

Definition 6.3.1. The redshift factor z of a photon originating at x 
and being detected at y is given by 



z{x,y) 



\{y) - X{x) 
A(x) 



If (t, :r), (t, y) G I X are the events occupied by two galaxies, then the 
distance of these events at time t is given by d((t, x), (t, y)) = a(i)di;(x, y), 
where d^{x^y) is the distance of x and y in (A7, y^). We will show 
that there is a constant H such that we have approximately z{x^ y) — 
Hd{{t^x), (t, y)) for galaxies which are distant enough for Hubble’s dis- 
covery to hold but still so close that it is sensible to linearise 2:. To 
this end we must first calculate the null geodesics in Robertson- Walker 
spacetimes. 

Lemma 6.3.1. Let (M,y) = ((t_,t+) x — dt^ -f- a^(t)ye) be an in- 

finitesimally isotropic Lorentzian manifold. The curve 



s h-> 7(s) (i(s),7(s)) 



is a geodesic if and only if 



0 + (i if) «(*)«' = 0, V^ + 2^i^ = 0 
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hold) where V denotes the induced covariant derivative on E, //7(s) = 
(t(s),7(5)) is a null geodesic then the conservation equation 



holds. 



dt 

a[t{s))— = const 
as 



Proof. The first part follows immediately from the corresponding formu- 
las for general warped products (Corollary 4.4.1). Assume now that 7 

is a null geodesic. From the first equation and ^7,7^ = (d:^/ds)^ we 
obtain 




= a'{t) 




dH 



Corollary 6.3.1* Let {M,g) = ((t_,t+) x -d^^ + a"^{t)ge) be an 
infinitesimally isotropic Lorentzian manifold. The curve s 7(5) = 
(t(s),7(s)) is a null geodesic if 

(i) r j(r) is a unit speed geodesic in {E^ge) 

(ii) and there is a constant c such that 




Proof. Assume that 7 is a unit speed geodesic and that the integral 
equations (ii) hold. The curve 7(5) = {t{s))^{s)) is a null curve because 
of 



(7((«)).7(s)) = -(di/ds)2 + 



d - d 
d7^>d7^ 
d-d 



r(dt/ds)^ 



= (dt/ds) ( -1 + ( — 7, ) ) = 0. 



It follows from the first integral equation in (ii) that adt/d$ is constant. 
Hence the first equation in Lemma 6.3.1 follows from 





= a'{t) 




+ a{t) 



dH 

ds2 



and the fact that 7 is a null curve. The second equation follows also by 
direct calculation: 






— 2 da d - _ — 2 da 
c^a^drdr"*^ a ds"^ 
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where we have used ^ 

as a as a-^c 

To see that all null geodesics can be described this way it is sufficient 
to notice that (up to a multiple) any null vector can be realised as 5^ + e 
where e is a unit vector tangent to L\ , I 



Proposition 6.3.1, Let {M^g) be infinitesimally isotropic and ^ be a 
photon which moves from x £ M to y e M. Then 

a{t{x)) ■ 

Proof. Prom Lemma 6.3.1 we get that is constant. Hence E = 

— {U,^) = ^ implies that A/a = h/{a{t)^) =: k is constant. Inserting 
A = fca in the definition of proves the claim. I 



Corollary 6.3.2. Let xq = (to,xo) 6 M. Then the frequencies emitted 
by nearby galaxies (situated at y — {t,y) £ M) appear to be red-shifted 
at xo by 

z w Hto d{{to,xh), {to,yh)), 

where — a'{to)/a{to) is the Hubble “constant” at xo- 

Proof We assume that ds{xo, yo) ^ 1 which in turn implies |< — to| ^ 1- 
Hence we obtain a{t) » o(to) + (t — to)a'{to) and therefore 

^ o(to) _ a(to) 

a(t) a(to) + (t - to)a'(io) 

" l + (t-to)Eto ■ ^ ^ 

Since the speed of light is 1 and t < to we have t~to — d((to,xh),(to,yo) 
which implies the assertion I 



6.4 The age of the universe and the big bang 

At our time to the Hubble constant Ilto = is positive. 

In the introduction of the preceding section we have given a heuristic 
argument which indicates that universe is expanding because of the red- 
shift of light emitted from nearby galaxies. In this section we we will show 
that the observation of Hubble implies that there has been a big bang (if 
infinitesimal isotropy holds). In order to do so we need an assumption 
which eliminates the unphysical cases that 
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— a is simply not defined on its maximal domain or that 

— a is not everywhere differentiable where it is defined. 

We will therefore assume that e,p: / — ^ M can be continuously extended, 
unless they become unbounded or a ^ 0. If this assumption would not 
hold then there would exist an extension (/ x ~dt^ of spacetime 
such that at d{I x E) C I x E matter would miraculously disappear or 
spring into existence. 

Theorem 6.4.1* Let (M,g) = x E, -dt^ + a^{t)ge) be an in- 

finitesimally isotropic, -maximally extended Lorentz manifold of di- 
mension n > 3. If 

(i) there is a > 0 

(ii) e, p are continuous on 
(Hi) e 4* yl/(87r) > 0, 

(iv) there exist constants c± such that e and p satisfy 

^ _ ^ P- A/{8h) ^ ^ 
n - 1 - € + yl/(87T) - +’ 

thent- > —00. In addition, we havelimt^t^ a{t) = 0 and limt_^t„ a'(t) = 
limt_>t_ a'^{t)(e{t) + A/(87r)) = co. For t^ there are the following possi- 
bilities. 



e 


u, 


lim a(t), 

£ — 


lim a'(t) 


lim a^(t)(e(t) + /1/(87 t)) 

t — 


-1 


00 


00 


1 


0 


0 


00 


oo 


0 


0 


1 


finite 


0 


-00 


00 



Proof If € can be extended beyond t- or as a bounded function so can 
p. For given e,p, Corollary 6.2.2 (i) can be viewed as a linear differential 
equation of second order for a. Consequently, if t± is finite, a could be 
extended as a function if it is not infinite (Dieudonne 1960, Remark 
10.4.6). This implies liminft->ti a{t) = 0 or limsup^^^^ a{t) = oo since 
{M,g) is -maximally extended by assumption. Conditions (iii) and 
(iv) imply that 87 t ^p -}- —f — 2i4/(n — 1) is positive Hence a"(t) < 0 
for all t by Corollary 6.2.2 (i). The function a' is therefore monotone and 
we can replace liminf (and limsup) by lim for a and a^ 

First we investigate what happens near t-. The inequality > 0 
implies the inequality a' (to) > 0. Since a"{t) < 0 for all t, the graph of 
a lies below the graph of the map t ^ a(to) 4 a'(to)(t - to)* This linear 
graph intersects the (a = 0)-axis at to - which implies that t_ 

is finite: t_ e [to - a(to)/a'(to), to). Since p - 4/(87 t) > c_(e -h T/(87 t)), 
there is a d > 0 with 
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e + p > (1 + c_)(e + ^/ (87t)) > (1 - — ^ ^ ^ *^ )(e + .l/( 87 r)) 

2 + (5 

Hence Corollary 6.2.2 (ii) implies (e + A/{%-k))' = -(n - l)(e + p)a'/a < 
— ( 2 +(J)(e+./l/ ( 87 r))a^/ a and therefore ((e+yl/( 87 r))(i^'*"^)^ < 0. The equa- 
tion limt^tj. o(f) = 0 implies now immediately that limt_>t_ o^(e(t) + 
yl/(87r)) = oo. Prom the equation for the energy density e in Corollary 
6.2.1 we infer that [a'Y diverges also. 

For t —> t+ there are several possibilities. If a has no mavimnm 
then limt_t^. a{t) = oo. a" < 0 implies that t+ = oo. Since ((e + 
A/ (87r))a^‘*‘^)' < 0, the function (e -I- A/ (8Tr))a^'^^ is decreasing which 
implies limt^(^(e(t) + +A/{S'K))a'^{t) = 0. The equation for the energy 
density e in Corollary 6.2.1 implies now that £ < 0. The assertions about 
li'^t^t+ ^'{i) follow from the same equation. Note that a must have a 
maximum if £ = 1. 

If a has a maximum at some ti € 8ire{ti) + A = ^(n-l)(n- 

2)e/a^{ti') whence £ = 1. Since <x"(ti) < 0 we have < 0 for some 
h € {h,t+) and we can — by time reversal — apply the same argument 
as for t t-. This proves limt^t+ a{t) = 0, limt^t^ a'(t) = -oo, and 
limt^t^ a^{t)(e{t) + A/{8 tc)) = oo. | 



Corollary 6,4.1. If our universe is described by a Robertson- Walker 
model without cosmological constant, then it is younger than 1 /Htg 

To obtain a numerical estimate note that according to measurements of 
the luminosity of stars (so-called “standard candles”) located at different 
distances one arrives at Hq = 1.7 • 10~^®s“^ (Wald 1984, p. 114) (see 
also (Weinberg 1972)). Since 1 year = 60 • 60 • 24 • 365 s we obtain that 
the universe should be younger than l/Htg = 2 • 10“ years. Wald (1984) 
also quotes experimental evidence which points to values for Hq which 
are twice as high. This would imply that the universe is less than half 
as old as indicated above. 

Part of Theorem 6.4.1 can be generalised to locally spatially homoge- 
neous universes which are not necessarily isotropic (Kendall 1994). With 
respect to the significance of the assumption d = 0 cf. Remark 5.3.3. 

Example 6 , 4 . 1, The inequality < c„ < is sharp. This 

can be seen from 3-dimensional dust spacetimes. We solve the system of 
differential equations (i), (ii) of Corollary 6.2.2. Both equations decouple 
since p = 0 and n = 3. Equation (ii) implies that there is a constant 
fc € R with e = ko? . Equation (i) reduces to — —Aa^ whence a(t) = 

. Prom our initial data ao,6o we obtain a^(0) = 
a/ (S yreo + A){ao)‘^ — e and therefore 
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, \/{8 7T eo + ^) (ao)^ - £ sm{VAt) 

a(t) = ao cos( V At) H 

VA 



for > 0, 



a{t) 



^ao A — y/—Ay/ (8 7T €o + A) (cto)^ ~ 



2A 



+ 



A + (8 7T eo + A) (ao)^ - e) e 

2^4 



for yl < 0, and 

a(t) - ao + \/S (ao)^Treo e t 

for id = 0, If e — 1, yl < 0, and the initial data ao, eo are chosen such 
that (eo -h yl) (ao)^ = 1 then we obtain solutions without singularities. 

Remark 6.4^ L Observe that the metric of Robertson- Walker spacetimes 
differs from the non- singular Lorentzian manifold 

((i_,t+) X r, ~dt^ + + sin2(0)d^2)^ 

only by an over all- factor a^. (We say that these pairs of spacetimes 
are conformally equivalent). This implies that there are past light cones 
which do not intersect. In particular, there are regions in the universe 
filled with particles which may never have had a chance to interact. 
This raises a serious problem. In physical theories, homogeneous states 
are usually obtained by the statistical description of microscopic states. 
In particular, homogeneity is always a result of prolonged interaction. 
If there are regions in the universe which cannot have interacted with 
each other in the past, then we need a new explanation why they are 
so similar that we can describe the universe as homogeneous. Physicists 
are currently trying to find an answer to this question by arguing that 
during the early phase of the universe the expansion was much faster 
than would be plausible if one considers ordinary matter. Under this 
assumption, these regions would have had a chance to interact after all. 
However, the physicists favouring this inflational universe are still^ very 
far from a satisfactory physical explanation for the occurence of this 
“inflationary” phase in the history of the universe. 



4 



I write this in 1998. 
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6.5 A simple model for the universe we live in 

In this subsection we will discuss two especially simple cases in which 
Einstein’s equation can be explicitly solved. They are of special impor- 
tance since they are in accoidaixcc with qualitative expectations of the 
development of the universe. At present, there seem to be two principal 
types of matter which fill the universe. To simplify matters we may as- 
sume that today most of the matter consists of galaxies^. Galaxies do not 
move much relative to each other and are too far apart from each other to 
interact. It is therefore a good description to model them by pressure-less 
dust. The universe is also filled with radiation which was the dominat- 
ing form of matter during the early stages of the Universe. This 'cosmic 
microwave background radiation has been discovered by Penzias and 
Wilson (1965) The most important feature of this microwave back- 
ground radiation is that it is (almost) completely isotropic and therefore 
cannot be explained by a confined source which is located somewhere in 
the universe. The radiation is now very weak (having a temperature of 
about 2.7 Kelvin) and its energy is completely dominated by the energy 
contribution from the galaxies. The microwave background radiation can 
be described by a photon gas. The energy momentum tensor of a pho- 
ton gas is traceless by Lemma 5.2.3 which, together with infinitesimal 
isotropy, implies c = (n ~ l)p. 

We will not consider the composed system which consists of dust 
and radiation but only the much simpler cases where we have pure dust 
or pure radiation. At the end of the section we will give a heuristic 
justification. 

Lemma 6.5.1. Let (M,g) == ((f_,4) x E,-dt^ -t- a^{t)g^) be an in- 
finitesimally isotropic Lorentzian manifold of dimension n > 3 and as- 
sume that the function a is non-constant. Then the following statements 
are equivalent 

(i) p = 0 

(ii) — m is constant, 

(Hi) (n - l)(n ™ 2) ((a')^ -he) - 4- 2A, 

Proof The equation p = 0 is equivalent to (n - l)a' / a P/e = 0 because 
of Corollary 6.2.2 and a' / 0. An integration shows that this equation 
is equivalent to — m for some constant m. The equivalence of (ii) 
and (hi) is clear from the formula for e in Lemma 6.2,1. I 

® However, it is expected that most of the matter is “dark” which is an eu- 
phemism for “we cannot directly observe it and do not know anything about 
it”, 

® The discoverers where concerned with the development of a new satellite 
communication system and found that their new high precision antenna 
seemed to have an unexplainable background noise. 
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In the physically interesting case n = 4, A = 0, the differential equations 
in Lemma 6.5.1 (iii) can be solved explicitly and their solutions are given 

by 



e 


a 


t 


-1 

0 

1 


50(1 — cosh(i9)) 

(9c/4)^/^i2/3 

5c(l - cos(i?)) 


- sinh(i?)) 
ic(i? - sin(i?)) 



where c is a constant of integration. 

Lemma 6.5.2. Let {M,g) = x + a^{t)ge) be an in- 

finitesimally isotropic Lorentzian manifold of dimension n > 3 and as- 
sume that the function a is non- constant Then the following statements 
are equivalent 

(i) e = (n - l)p 

(ii) eoP' ~ m is constant^ 

(iii) {n - l){n - 2) ((aO^ + e) = IGTrma^-’^ + 2/1. 

Proof The equation e = (n - 1) is equivalent to (n - 1)^ + === 0 

because of Corollary 6.2.2 anda' / 0. Integrating this equation we obtain 
ea'^ = m for some constant m. The equivalence of (ii) and (iii) is clear 
from the formula for e in Lemma 6.2.1. I 

In the case n — 4, A — 0 the resulting differential equations can be solved 
explicitly and the solutions are given by 



6 


1 1 — >■ a{t) 


-1 

0 

1 


c\j-\ + (1 + tjsr 
(4c2)i/4Vf 

c-y/l - (1 - tjcT 



where c is a constant of integration. 

We will make the assumption that the interaction between both types 
of matter is negligible. Observe that due to the formulas (ii) in Lemmas 
6.5.1 and 6.5.2, radiation dominates at early times (a C 1) and dust 
dominates at late times ((a > 1). Hence it seems to be a good approx- 
imation to use the radiation model for the early universe and the dust 
model for the present universe. 






7. Spherical symmetry 



This chapter serves two purposes. Firstly, a large isometry group simpli- 
fies the problem of solving Einstein’s equation considerably. Virtually all 
explicitly known solutions of Einstein’s equations for physically plausi- 
ble matter fields have a high degree of symmetry. Secondly, a spherically 
symmetric spacetimes are very good descriptions of non-rotating, iso- 
lated stars and therefore of astrophysical interest. (If the star rotates 
the rotation axis breaks the symmetry). In Sect. 7.2 we will see that 
there is a unique 1-parameter family of spherically symmetric solutions 
to Einsteins’s equation for vacuum with vanishing cosmological constant. 
The parameter can be interpreted as the mass of the isolated star. If the 
mass of the sun is chosen, one obtains an excellent model of the gravi- 
tational field in our solar system. Some aspects of this model have been 
verified experimentally. These solutions also form the basis for much of 
our intuition of black holes. 

In this chapter we will also discuss the initial value problem for the 
case that the energy momentum tensor represents a perfect fluid to- 
gether with a non-interacting electric field (cf. Sect. 7.4). While in this 
more general case we will not obtain explicit solutions we will neverthe- 
less arrive at a non-trivial existence theorem for the considered class of 
spacetimes. This section will hardly be of primary interest to a geomet- 
rically oriented reader. Since the discussion uses elements of the theory 
of systems of hyperbolic partial differential equations even physically 
oriented readers may wish to skip the proofs on first reading. 

The validity of the physical conclusions from this (and also the fol- 
lowing) chapter depends very much on the question of whether the cor- 
responding properties of our explicit solutions are stable under perturba- 
tions. We know only very little about the stability of Einstein’s equations. 
Since they are highly non-linear it is well possible that these properties 
have little to do with our actual universe which has only “approximate 
isometries” . 
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7. Spherical symmetry 



7.1 Pseudo-Riemannian manifolds 
with spherical symmetry 

A property in is spherically symmetric if it is invariant under rotations 

about the origin. The rotational isometries defined below form a group 
which is (locally) isometric to the rotation group SO(3). We take this as 
our main justification of the following definition. 

Definition 7.1.1. A pseudo- Riemannian manifold (M,g) is called 
spherically symmetric if it has a dense open subset M° such that (M°,g) 
can locally be written as a warped product {S x S'^^gs + r^dl?^), where 
dJ?2 is the metric of the 2- dimensional unit sphere, (U,gs) an (n — 2)~ 
dimensional pseudo- Riemannian manifold, and r: E ^ R a positive 
function. 

The sets {x} x (x ^ E) are called spheres of symmetry and those 
isometries which map all spheres of symmetry into themselves are called 
rotational isometries. The set C ~ M \ is the centre of symmetry. 

Lemma 7.1.1* Let (M,g) be a spherically symmetric spacetime and 
{E,gz:) be the 2-dimensional Lorentzian manifold orthogonal to the sphe- 
res of symmetry. For each frame {U, Q} of (E,gs) with 

{Q,Q) = 1, 

there are adapted coordinates with respect to which 

g ~ -I- sin^^d^?^) 

and U = e~^dt, Q ~ e~^dq. 

In these coordinates the energy momentum tensor T is given by 

87rr({/, C/) = 1 (1 + ([/ . rf - (Q . r)^) 

+ ^ (-Q • <5 • r + (C/ • X)(U • r)) - A, 

8nT(U, Q) = ~{U*Qr-{Q»u){U* r)) , 

87tT(Q, Q) = -1 (1 + (t/ • r)2 - (Q . r)2) 

+ ^ (-C/ • 1/ *r + (Q . t/)(Q .r)) + A, 

87rr(V; X) = 0 for all X eTE andV € TE-^, 

87tT(^, = 87rT(;r^^, 

= Q»Q*u-i-{Q»uf -U*U»\-{U»Xf 
+ — U • U • r Q • Q • r — {Q • t')(<5 • r) 
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-(U.X){U»r)) + A, 

Proof. The existence of the adapted coordinates (t, 0, (p) is clear from 

Corollary 2.4.2. Hence we only need to calculate SttT = Ric — |Scal g + 
Ag. By Lemma 4.4.15 we have for X, Y tangent to E and y, W orthogonal 
to E 

Ric(X,y) = Iscalr {X,Y) - -VVr{X,Y), 

A r 

Ric(X, y) = 0 

1 ( /Yr 1 \ 

Ric(y, w) = ^ (y, Ty) - f — + - (grad(r), grad(r)) j (F, W ) , 

2 4 2 

Seal = Seals ^ ^ (grad(r), grad(r)) . 

Sinee VVr(X, Y)=X»Y»r - X »r we obtain 
VVr(C/, U) = U»U*r-{Q» v){Q • r), 

Wr{U,Q) = U »Q »r - iQ •u){U »r) = Q »U *r - {U • X)iQ »r), 

VVr(Q, Q) = <5 • Q • r - ([/ • A)(17 • r) 

whieh in turn implies Ar = —U»U»r + Q»Q»r+{Q»u)(Q»r) — {U • 
A) (17 • r). Henee we get 

87tT(17, U) = -Iseals --{U»U»r-{Q» v){Q • r)) + 1 (Seals 
+ ^ - ^{-U .17.r + Q.(3.r + (Q. v){Q • r)- 
(t/ • \){U • r)) - ^(-(17 • rf + {Q* rf)) - A 

-^(Q*Q*r-(t/.A)(17.r)) -yl, 

87rT(C/, Q) = -^ (17 • Q . r- - (<5 • i/)(17 • r)) , 

87rT(<3, Q) = iseals - ^ (Q • Q • r - (1/ • A)(17 • r)) - 1 (Seals 
2 4 

+ ^ - -(-17 *17»r + (3«(5«r + (Q* j/)(<3 • r)- 
(U . A)(C/ . r)) - ^{-{U • rf + (Q • r)^)) + A 
= -l(l + (t/.r)2-(g.r)2) 
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- ^{U *U *r - {Q • iy)(Q •r)) + A. 

Let X G TxX and V € Since Ric(l/, X) = 0 and {X,V) — 0 

it is clear that %tcT[X^V) = 0. Since there is an isometry which maps 
T into we have 87 tT(^,^) = The Ricci 

tensor restricted to is a multiple of the metric which implies 

T(^, Finally, we the last component of T which needs to 

be calculated is given by 

87rT(^,^) = ;l- (l(-t/*C/.r + Q*Q.r + (Q*i/)(Q-r) 

- {U . \){U • r-)) + 1 ( - ([/ . r)2 + (Q . rf)) 

- hscals + - -{-U •U •r + Q^Q^r 

2 ' r 

+ (Q • i^){Q •r) - {U • \){U • r)) 

-^(-(^•r)^ + (Q»r)^))+A 

= l(_I/*J7.r + Q*<5*r 

r ^ 

+ (Q*i^)(QT)-(t/.A)(C/.r)) 

-([/•[/•A + (C/*A)2-Q*g.i/-(Q. j/)2) + A, 

where in the last equation we have used 

Seals = 2(t/ • [/ • A + ([/ • A)2 - Q • Q • 1/ - (Q • i/)^) 

(cf. Proposition 4.3.5). I 

We will now re-arrange these equations in a form which is more practical 
if one wants to solve them. We will not do this in complete generality but 
rather assume the following genericity assumption on the matter model. 
Recall that for any normalised timelike vector v the number T(v,v) 
represents the energy density measured by u. This number should be 
positive. For null vectors N we obtain then T(iV, X) > 0 by continuity. 
Given a non- extreme matter distribution it is therefore plausible to ex- 
pect T(iV, X) > 0 for all null vectors N, In this case it is always possible 
to diagonalise g and T simultaneously (Greub 1981, Chapter IX §3). 

Lemma 7,1.2. Let {M,g) be a spherically symmetric Lorentzian man- 
ifold and T be a symmetric ( 2 ) tensor field which is also spherically 
symmetric. Assume that T and g can be simultaneously diagonalised. 
Then there exist coordinates (t,g, V?) such that 

g = g2A(t,g)^^2 ^2^^^ ^^^^^2 .p gin^ ^ d(/?^), (7.1.1) 
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T = eU^ + PradQ'" ® + Psph‘>'^{t,q){d9^ + siri^ 0 d(p^), (7.1.2) 



where U := e~‘'dt and Q := e~^dq are invariantly defined if e^ — Prad- 
Moreover, Einstein’s equation, Ric — Scal/2 g + Ag, = SirT is equiva- 
lent to the system of differential equations 



1 + {U »rff ~{Q*rf 
2r 



Q»Q»r = {U*r) (C/*A) + 

- 47rr(e + ^), 

Q»U •r = {Q*r) {U*X), 

U •U •r ~ {Q •r){Q •}/) — 

- 47rr(prad - 

[/ • C7 • A = -(C/ • A)2 + Q • g . 1/ + (g . z.)2 + '^ + {U*rff ~{Q»rf 



1 + (C7 »r)^ — {Q • r)^ 



2r 



(7.1.3) 

(7.1.4) 

(7.1.5) 



4 ar{e - prad + 2 psph), 



(7.1.6) 



and the equation of motion, div(T) = 0, is equivalent to 

U »r 



U •e = -{e + pra.d)U • A-2(e + psph)- 

/ 

Q • Prad — ~(c A Prad)Q * V — 2(prad — Psph) 



g *r 



(7.1.7) 

(7.1.8) 



Proof. Lemma 7.1.1 implies that T satisfies 

- T{X, V) = 0 for all X e V € (TxM)-^ 

- and T{V, V) = T(W, W) for all unit vectors V,W € (xE)^. 

Since by assumption there is a firame which diagonalises the energy mo- 
mentum tensor T and the metric g simultaneously there must exist vector 
fields Q,U tangent to S and functions r, e,prad.Psph: X — > M such that 



5 = -17^ ® [7^ + g'’ ® gb + r^dn^, 

T = eU^®U^+ PradQ^ O + Psphr^d^2^. 



The existence of adapted coordinates follows now from Corollary 2.4.2. 
That Einstein’s equations are equivalent to Equations (7.1.3)-(7.1.6) fol- 
lows immediately from the definitions of e,prad>Psph and Lemma 7.1.1. 
Observe that for any vector fields X,Y tangent to S the decomposi- 
tion vector fields V, W tangent to the 

spheres of symmetry the decomposition VyW — (V, VK) grad(r) + 

^ VyW holds. Using Proposition 4.3.5 we obtain 



div(T) = (de(C/) -1- €div(t/))t/' + e{VuUf + (dp.ad(Q) + Pr^div{Q))Q^ 
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+ Prad(Vg<5)'’ + (]?sphdiv(-a0)) r 

+ Psph + Psphdiv(— d^)r sm{9)dip 

"i~ Psph ( V 1. ^ : — 77i\^^ 

\ rsin{0)^^rsin{6) ^ 



= (de(C/) + e((C/ . A) + -([/ • r)) + prad(t/ • A))C/'’ 

+ (d^>rad(<3) +Prad((<5 • I^) + ^(Q • r)) + e(Q • iy))Q^ 



“1“ T^sph ^ 



/cos( 0 ) 



1 



. , . A6 - -dr + 0 + 0 - Idr - -r-^d6>') 
sin(^) r r sin(0) / 



= (d€(C/) + e{{U . A) + 2(t/ • r)) +p,^[U . A) + 



(dprad(<3) + Prad ((<5 • ^^) + “ (<5 • »")) + e{Q • v) 



+ 

” 2psph 



where we have used Proposition 4.3.5 to calculate 



du: 



div( 0 ,) = dr 2 ^e V 5 a,, a,) + v^a,, 

^ sin(^) Sin(^) 



cos( 0 ) 

sin( 0 ) ’ 



sin(^) ’ 



The functions C/ • A and Q are well defined invariants since the com- 
mutator of U and Q is given by [[/, Q] = (Q • iy)U — (U • X) Q. 

For spherically symmetric spacetimes we can define an invariant func- 
tion which can be interpreted as a mass. 

Definition 7,1»2. Let (M,p) be a spherically symmetric spacetime. 
Then its mass function m is defined by m:= ~{1 - (grad(r), grad(r))) 

The term “mass function” can be motivated under the additional as- 
sumptions that there exists a centre of symmetry, that Q •r > 0 , and 
that it is possible to diagonalise g and T simultaneously. Recall from 
special relativity that mass and energy are equivalent concepts and that 
the energy of a point particle measured by an observer depends on this 
observer. 
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In order to determine the mass of a material object consisting of 
particles in spacetime we would first fix a spacelike hypersurface rep- 
resenting an instant of time. The mass of each particle which intersects 
this hypersurface would be measured by the infinitesimal observer repre- 
sented by the normal of the hypersurface (cf. Sect. 1.4.3). The sum over 
all these numbers is then the mass of the material object with respect 
to the chosen hypersurface. 

Since we assume that we can simultaneously diagonalise g and T there 
must be a timelike eigenvector of the linear map Observe that 

it is unique if e 7^ "Prad- In this case it is orthogonal to the spacelike 
hypersurfaces t = const. In other words, the hypersurfaces t = const 
are invariantly defined. This indicates that we should use this family of 
hypersurfaces in order to define mass. 

Multiplying Equation (7.1.3) with Qmr and inserting Equation (7.1.4) 
we obtain 87r(e + • r = Q • (r(l + {U • r)^ ~ {Q • r)^)) —2Q^m 

and therefore 




where t = to is fixed. 



(7.1.9) 



This integral can also be written as a volume integral, 




sin(^) dr A d0 A dip 
sin(0)((5 • r)e“^ dg A d0 A dtp 






where B is the ball {x e M : t{x) ~ to, q{x) < qq}. Equations Q^r = 1 
and A = 0 would imply that we had just an integral over the energy 
density e which — in view of E = mc^ — can also be interpreted as a 
mass density. If, in addition, Prad = Psph = 0 then T represents a smooth 
3-parameter family of freely falling particles and we would obtain the 
smooth analogue to the motivation using individual particles above. 

In general, however, Q^r ^1. This reflects that one also has to take 
into account the energy contribution of the gravitational field. 

Lemma 7*1*3* Letm be the mass function of{M,g), Then 
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Proof. We have U^Q^r = Q^U^r-\~[U,Q]^r = Q •U •r + (Q 0 i/){U • 
r) - {U • X){Q • r). By Lemma 7.1.2 we can calculate 

Q • m = Q . ( I (1 + (t/ . r)2 - (Q . rf)) = 

i r 

+ r {yQ •U •r -{Q* r)Q • Q • r) 

= + r({U • r){Q • r)U • A - (Q • r){U • r)U • A 

- (Q • r)rn + 4nr{Q * r) ) 

and 



U 0V 

U mm— ^ m + r {{U • r)U 0 U 0 v ~ {Q 0 r)U 0 Q 0 v) 
XJ 0 T 

— m 4- r((f/ • r){Q 0 r)Q 0v — [U 0 r)m 



- 4 :TTr{U 0 r) ^ j ~ {Q0V)Q0U 0T 

~ {Q 0 r){U .r)Q0v + {Q0 r)^U 0 A)) 

^ + r (^-m{U • r) - 47rr(?7 • r) ^prad - 



= -A'Kr^iU • r) ^pfad - • 



While orthogonal coordinates are often very useful double null coordi- 
nates introduced below are much better adapted to the geometry of 
spherically symmetric spacetimes. 

Lemma 7.1.4 (double null coordinates). Let (M^g) be a 4-dimen- 
sional spherically symmetric Lorentz manifold. Then there exist local co- 
ordinates {u,v^8,ip) and functions G: (u^v) ^ F{ujv) eR, r: (u,v) 
r(u, v) eR such that 

g ~ G(u,u)dudu + r^{u,v) (d0^ 4- sin^(^)d(y 9 ^) . 

The function G is unique up to transformations of the form u h- > u{u) 
V 1 -^ v{v) and interchanging of the coordinates. 

In these coordinates^ the Christoffel symbols are given by 

r„\ = a„(inr), r% = r:^^ = du{\nG\ 




7.2 The Schwarzschild solution 315 



r-„ = a.(inr), r% = r^^ = d,{]nG), r,*-, = -i— 

sm [Oj G 

"" sS(^’ cosC*?) 

Proo/ Since in a two-dimensional Lorentz manifold there exist for each 
point exactly two linearly independent, lightlike directions, the existence 
assertion follows immediately from Definition 7.1.1 and Corollary 2.4.2. 

Let Uy V be coordinates and G(u^v)^ r(u,v) be coordinates with 
g = G{u, v)dMv~\-f'^{u, v) (d^^ + siv? {6)d(p^) , Since the warped product 
is invariantly defined we have G{u, u)dwd^; — G{u, v)dudv. At each point 
of a 2-dimensional Lorentzian manifold there are exactly two null direc- 
tions whence we can assume (without loss of generality^) that there exist 
functions fu, fv with 5^ = fudu and 9^) = fvdy. Since the commutator 
of Gaufiian vector fields vanishes we obtain 



0 = [fudujvdv] == fu{dufv)dy - fv(dvfu)du 

and therefore dufv ~ dvfu = 0. This implies u - / fu{u)du and v = 
/ fr,{v)du. 

It is straightforward to calculate the Christoffel symbols using the 
formula == \g’^'^{db9dc + dcQbd - ddQbc)- ■ 



Remark 7J,1. The function r : M R gives the area of the orbits via 
the equation Area(5a;) = 47rr^ and is therefore invariantly defined. That 
G is almost an invariant is one of the two main reasons why double null 
coordinates (u, v) are a very practical choice. The other reason is that in 
these coordinates the causal structure of (M, g) is explicitly described. 



7.2 The Schwarzschild solution 

In this section we will solve Einstein’s equation for the case of a spher- 
ically symmetric vacuum spacetime. These solutions describe the grav- 
itational field caused by a single non-rotating star which is situated in 
empty space. As the sun rotates rather slowly and space is almost empty, 
these solutions describe gravitation in the solar system very well. 

Theorem 7.2.1 (Birkhoff). Let (M, g) be a spherically symmetric vac- 
uum spacetime. Then either r = ^ is constant and the mass function is 
given bym — l/(2\/Z) or there is a constant mo and a dense^ open subset 
M° C M such that each x G M° admits a local coordinate neighbourhood 
with local coordinates (t, r, 0) which satisfy 



^ Otherwise we exchange u and v. 
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9=~ 





dr + 



dr^ 



(1 



2mo _ 



+ (d^^ + sm^{9)d(p‘^) . 



Proo/ Lemma 7.1.3 implies 



U • m ~ U • 




and Q • m = Q • 




Hence there is a constant ?no such that m = mo+^. If (Q«r)^ = 

in an open set then from the definition of m we get | — mo + ^ which 
in turn implies that r = const. Since T — 0 we can choose coordinates 
which simultaneously diagonalise g and T, Equation (7.1.3) implies then 

= 1/yl and we obtain m = r/2 ~ l/(2-\/Z). 

Let us now assume that g^{dr, dr) = “([/ • r)^ + {Q • r)^ / 0. There 
are orthogonal coordinates (g, i) such that r ^ q. Since T = 0 this choice 
of coordinates trivially diagonalises g and T simultaneously and we can 
assume without loss of generality that in Lemma 7.1.2 we have t — i, 
q ~ q. We immediately obtain U • r — e~^dtq = 0. Hence Equation 
(7,1.4) yields U •X — Q and Equation (7.1.3) implies 



QmQmr ^ 



m 

^2 



At 

~2 



mo Ar 

r2 3 



Since Q — e ^dr this equation is equivalent to 



e ‘^^drX “ 



mo At 



and can be integrated to give 



. ^(1) _ ?!I!2 _ = A{t) - 

To T 



where A{t) is an integration constant. Equation (7.1.5) implies e“^^9y.i/ = 
^ and therefore e“^^9r^ ~ which in turn yields u == 

B{t) — A. After a re-parameterisation of t we can choose P = 0. m — 
|(1 - (Q • r)^) = |(1 - e“^^) = |(1 - A + 2m/r) implies A{t) = 1 and 
the assertion is proved. I 



Observe that any spherically symmetric vaccum spacetime is automati- 
cally static in the region ^^(dr, dr) > 0. In the region ^^(dr, dr) < 0 it is 
not static but has a fourth spacelike Killing vector field. 

This spacetime has (for A = 0) first been obtained by Schwarzschild 
(1916) who solved the static, spherically symmetric vacuum equation. 
Birkhoff then showed that staticity was not needed as an assumption. 
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Definition 7.2.1. A spherically symmetric vacuum spacetime with van- 
ishing cosmological constant (M, y) is called a Schwarzschild spacetime^. 
The coordinates {t,r^y)^0) are called Schwarzschild coordinates. 

In the rest of this section we will assume A ~0. 

The regions r < 2m and r > 2m cannot be matched naively using 
these coordinates (cf. Fig. 7.2.1 which represents the causal structure), 
and for some time it has been believed that there is a physical singularity 
at r = 2m. Below we will geometrically determine more useful double 
null coordinates of the solution. This will show that r = 2m is a spurious 
singularity and that there exists a unique, inextensible solution of the 
spherically symmetric vacuum equation. 




Fig. 7.2,1. Schwarzschild spacetime in Schwarzschild coordinates 



Proposition 7.2.1. With 

f:R+\ {0} ^ (- 00 , 1), r /(r) = -^ (^1 - ^) 

F: E+ \ {0} ^ R+ \ {0} r F{r) = , 

r 

let Bschw = ; xy < 1} andge^^,^ = Fof~^XY) dXdY. 

Then the Lorentzian manifold 

X + {f~'^{XY)f {de'^ + sin'^{e)d(p^)) 

satisfies Ric = 0. The coordinates (t,r) of Theorem 7.2.1 and (X,T) are 
related through 

= 

^ In the literature this name is usually reserved for a subset of the maximally 
extended Schwarzschild spacetime, the shaded region in Fig. 7.2.2. 
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Proof. We can restrict to the base manifold Bschw For any null vector 
field N which is Uf r of symmetry the equations 

™(l“2m/r)(iV^)’'^ + (l-2m/r)~^(iV^)^ = 0 holds, whence we have — 
±(1 — 2m/r)”^A^^, Double null coordinates can now be obtained through 
an integration of these two vector fields. Since /J^(l ~ 2m/r)"^dr = 
r + 2m In (^ - l) we define our coordinates by 

i- = i+(r + 2ml„(^-l)), r.(-(r + 2ml„(^-l)). 

This gives dXdF = (dt + (1 - 2m/r)“Mr)(dt - (1 - 2m/r)“Mr) = 
df^-(l-^)"^dr^ and therefore = (l-^)dXdy. From X-F = 
2(r + 2 mln( 2 ^ — 1)) we obtain 

g(X-y)/(4m) _ gr/(2m) ( S I'l = JL.Qr/(.2m) A _ _ _f(„\ 

V 2m J 2 m \ r J ' 

which implies 



PB.ehw = (l - v) ^ 



We set X = Y ~ — and finally obtain 



3 Bschw 



32m^e-’'/(2'”) 



r 



dXdF, 



m = XY. 



Furthermore, dx and dy are both future oriented (this has been the 
reason for choosing the minus sign in the coordinate transformation for 
Y). It remains to show that the inverse of / exists for all r > 0. But this 
follows immediately from f(r) = “ 4 ^^^ <0. I 



The coordinates provided by Proposition 7.2.1 are called Kruskal- 
Szekeres- coordinates and the corresponding spacetime is often called 
Kruskal-Szekeres- spacetime. This spacetime is locally isometric to the 
metric given in Theorem 7.2.1 but the global structure is different 
from the global structure obtained by using Schwarschild coordinates 
(cf. Fig. 7.2.2). Nevertheless, in this book we will refer to the inextensi- 
ble spacetime given in Proposition 7.2.1 as Schwarzschild spacetime. 

Remark 7.2.1. The motivation for the Schwarzschild spacetime is to de- 
scribe the exterior of a non-rotating star. During the lifetime of the star 
the radius may change (typically, it may shrink and perhaps even reach 
0). If we denote the r-component of the star at t by rgtar(i^) then we 
will need for each t only the part r > rgtar(^) of the shaded region in 
Fig. 7.2.2. The white region can be completely discarded for physical 
purposes. 




7.2 The Schwarzschild solution 319 




Fig. 7.2.2. Schwarzschild spacetime. Radial null geodesics are the 
straight lines X = const and Y = const. The region covered by 
Schwarzschild coordinates is shaded 



We will now investigate this solution in more detail and get a first 
glimpse at what is known as a black hole: In Proposition 7.2,3 below we 
will show that this spacetime is singular but inextensible. But first we 
need to calculate its geodesics. 

Lemma 7.2.1. Let (M^g) be a pseudo- Riemannian manifold and 
. . . , be coordinates such that gab 'Is diagonal^ 

n 

gabdx°'dx'’ = 

0=1 

Then the geodesics si->7(s) of (M,g) are given by 




Proof. We suspend the summation convention if the repeated index is a. 
Then we have gab = ga^b obtain 

(v^7, (7, da) - ( 7 , = £ {gaT) - 7 V (^6, Va/a) 

= ^ (ffo7“) - (dcgaSb + da9bc - dbgJa) 
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= ^ ^ {9aT) - \ {i^fdagh. 



Proposition 7.2.2. Let s 7 (^) ^ geodesic in Schwarzschild space- 

time 

(®2 X 52,5b....w + rH^Y) (d^2 + Sin2(^)d¥>2)) 

mth (7,7) = 7] e {- 1 , 0 , 1 } and assume that 7 ( 0 ) ^ (a: : r(x) = 2 m}. 
Then there exists a rotational isometry <p such that j — (po^ i$ given by 




where E, L are constants. 

Proof, We will use the coordinates provided by Theorem 7.2.1. Since 
the metric is diagonal we can apply Lemma 7.2.1 and obtain for the 
(t, 9 , 9?)-components of 7 



d // 2m\ dt\ 

ds \ \ r ) d$) 



= 0 , 



ds 







= 0 , 






There is an rotational isometry 4>: x x such that 9 o <j) o 7(0) = tt/ 2 
and d^(^^7(0)) — 0. Then 0 o 7 = tt/2 is the unique solution of the 
last equation. The first two equations can immediately be integrated. To 
derive the fourth equation in the assertion of the proposition it is more 
convenient to use the conservation property (7,7) = 77 than to use the 
r-component of the geodesic equation in Lemma 7.2.1. In fact, it follows 
directly from 




after inserting the equations for dt/ds and d<p/ds, I 



Lemma 7.2.2. Let {M^g) be a spacetime which is locally extensible. 
Then there is a null geodesic in (M, g) which is incomplete and extensible. 
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Proof, Let (M, g) be a local extension of (M, g), x e M\ M, and y 6 M. 
Then there is a (not necessarily future or time oriented) broken null 
geodesic from x to y. This broken geodesic 7 must intersect dM C M in 
a point = 7(0). Without loss of generality we can assume that 7(f) € M 
for t < 0, \t\ sufficiently small. Since z ^ M this geodesic is incomplete 
and extensible. i 

Proposition 7 * 2 * 3 . Schwarzschild spacetime is inextensible and geodesi- 
cally incomplete, A future directed null geodesic is incomplete if and only 
if it enters the region {1 - 2m/r < 0}, It then approaches r = 0 and the 
Kretschmann scalar given by Rated = 48 m^/r^ diverges along this 
curve. 

Proof, Lemma 4 . 4.14 implies Rrdrd = — 27^-r ' = 

-2^, R0cpO<p = 2rmsin^(0), Rotot = = “^(2m ~ r), and 

Rated = 0 ioT all other components which are not related to these com- 
ponents by the general symmetries of the Riemann tensor. It follows 
that R^^^^Ratcd - 48 m^/r^ and therefore that any curve 7(5) with 
r o 7(5) ^ 0 is inextensible. A curve 7 = (7Bschw ? 752 ) in ^schw x 
is extensible if and only if 7 Bsci.w is extensible in ^schw and 752 is exten- 
sible in S^, By Lemma 7 . 2.2 we only have to study null geodesics in order 
to prove that {M^g) is inextensible. If yB^euw i® extensible then dr/ds 
is bounded by Proposition 7 . 2.2 and r 0 . By Corollary 4 . 4.1 752 is a 
pregeodesic with bounded acceleration in a compact manifold and there- 
fore also extensible. Hence we can restrict to (Bschw,^?s«chw) study 
null geodesics in this 2-dimensional spacetime. In Kruskal coordinates 
(A, Y) these geodesics are given by A = const or T — const. Because of 
the reflection isometries (A, y) (T, A) and (A, T) 1-^ (-A, -1") we 
only need to consider future directed geodesics of the form X = const, 
y > 0 . The region y > 0 is the disjoint union of three different subsets, 

(i) r/2m > 1, (ii) r/2m = 1, (hi) r/2m < 1, 

each of them being invariant under future directed null geodesics X = 
const, y > 0, 

We have to estimate the affine parameter of our null geodesics. If 7 
is a null geodesic given by X = const, y > 0 then there is a function 
y 1—^ h{Y) with 7 = h{Y)dy and V/^^y^g^(/i(y) 9 v) = h{Y){h'{Y)dY A- 

h{Y)V0ydY) = h{Y){h'{Y) + h{Y)r^y)dv. Prom = dyHdXY), 

we obtain therefore h{Y) — c{gxY)~^, where c is a constant. 

In region (ii) gxY is constant which implies that 7 satisfies -^Y o 
7(5) = h{Y) — const and is therefore future complete. 

Now consider regions (i), (iii). Since 



dr/ds = a/£:2 „ L2/^2 (1 „ 2m/r) ^ |E| > 0 
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for r 00 the parameter s diverges if and only if r diverges. 

In case (i) we have (1 - 2m/r) < 0 and the square root is well defined 
for all s. The equation 

dF/dr = h{y) ds/dr = (i _ 2mlr))~^'^ 

implies that r diverges if Y diverges^. Hence s diverges for T oo and 
the geodesic 7 must be future complete. 

In case (hi) it is clear from XY — f{r) = -^ (l - that 

our future directed null geodesics X ~ const, T > 0 are approaching 
r = 0 and are therefore inextensible and incomplete. I 



The region r < 2m, X > 0 is the simplest model of a black hole. A 
black hole is loosely characterised by the fact that a light ray which 
enters it cannot leave it any more but instead reaches the edge of the 
universe before the aflBne parameter of the corresponding null geodesic 
has reached the value 00.^ Since a black hole does not emit a single light 
ray one is tempted to say that it is black, whence the name coined by 
J. A, Wheeler. However, this name is slightly misleading, since the black 
hole is not in the past of any observer who is situated outside this region. 
Rather than appearing black it is simply invisible. 

An observer who enters the region does not have a very low life ex- 
pectancy. The longest timelike curve within the black hole region is given 
by A = y, X G [0, 1]. In Schwarzschild coordinates this corresponds to 
the path = 0, r G (0, 2m). Hence the observer’s life is bounded by 



rO rO I 

^S= \/-5BHohw dr = / sj^mjr - 1 dr = 

J 2m J 2m 



Ttm, 



7.2*1 Experimental tests for the Schwarzschild solution 

In this section we will investigate the region 2m/r < 1 which may be 
considered as the exterior of a non-rotating, spherically symmetric star 
of mass m. The discussion applies in particular to the gravitational field 
produced by the sun which was Schwarzschild’s motivation for solving 
Einstein’s equation in this special case. 

^ This property could also have been seen geometrically: The lines X = 
const < 0, y >0 intersect all the hyperbolas r = const > 2m. 

A widely accepted general definition of black holes does not exist. The 
definition we have just given has the disadvantage that any Robertson- 
Walker solution which satisfies the assumptions of Theorem 6.4.1 and e = 1 
is a giant black hole. In this special case one would have to replace the 
condition that the null geodesics in the black hole don’t reach the affine 
parameter oo by the condition that they don’t end in the cosmological 
future singularity given hy t 
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Since the exterior region contains the timelike Killing field dt orthog- 
onal to the spheres of symmetry it admits a natural infinitesimal split 

0(9t)-^ of spacetime into space and time. Moreover, the distribution 
is integrable whence we obtain geometrically defined hypersurfaces 
of constant time. These hypersurfaces are given by 

^to = r,e,ip)\ t = to, r > 2 m}. 

Since dt is a Killing vector field the pullback of the metric to St does 
not depend on t and we can identify all St through projection along the 
t coordinate. A timelike curve in spacetime corresponds to a curve in 
the Riemannian manifold (Z'o, (1 - 2m/r)“^dr^ -b r^(d^^ 4- sin^(0)d<^^)) 
which represents space. 

In our case, we may imagine the non-rotating star to be the sun with 
radius fgun > 2 m. It is located in the centre r = 0 of the coordinate 
system but the Schwarzschild solution is of course only valid for r > 
rsun > 2 m.^ It follows that the region r < 2 m can be excluded from 
our discussion and we can utilise the spacetime split introduced above. 
It is natural to identify this spacetime split with the infinitesimal splits 
defined by our own world lines. ''Space” has then its intuitive meaning. 
While in general timelike geodesics represent freely falling particles, in 
our case they should be interpreted as planets (or perhaps asteroids and 
satellites). 

Because of Proposition 7.2.2 we can assume that the movement of a 
single planet or light ray is contained in the plane 0 = 7t/2. 

Lemma 7.2.3. Let 7 ( 5 ) = (t(s),r(s),0(s),(/?(s)) be a geodesic. Then we 




Proof, This follows by dividing the equations for dr/ds and d(f/ds in 
Proposition 7.2.2. i 



Corollary 7.2.1. Let 7 ( 5 ) = (t(s),r(s),0(s),(p(s)) be a geodesic. Then 
g(^s) = l/r{s) satisfies 




—7]m 



-f 



Proof, Substituting ^(s) = in Lemma 7,2.3 gives (d^/d<^)^ + 

-h ^^)(1 ~ 2m^) — jl?. Differentiating this equation implies 
the assertion. i 



^ Hence it does not matter that the Schwarzschild metric is not defined at 
the centre r == 0 where the sun is located. 
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Bending of light rays. Since (null) geodesics are influenced by curva- 
ture, according to general relativity, light rays should appear bent near 
regions where gravity is large. In particular a light ray passing the sun 
at a short distance should appear to be slightly bent. The experimental 
verification of this effect was one of the first tests of the theory. 

To describe this effect we need to determine the angle a under which 
a central object appears to an observer in Schwarzschild spacetime. This 
angle can then be compared with the corresponding angle determined 
by the background metric dr^ -f- r^di?^ of space (cf. Fig. 7.2.3) 




Lemma 7.2.4. Let ^ be a lightlike geodesic and 7(0) = ^o)- 

Then a = Z(-5^,7(0)) satisfies rosin(o;) = {\L\/E)y/l - 2m/r, 

Proof. Since 7(5) = dt/dsdt +^ the nu ll condition -(l-2m/r)|d^/d5p + 
= 0 implies I7I |a/1 - 2m/rdt/ds\ = (1 - 2m/ry^^^E. Since 
X dr the angle o: is given by sin(a) = |d(/?ds<9^|/|7(s)| = |L/r|/((l - 
2m/r)-i/2E). I 



Corollary 7.2.2. Let j be a lightlike geodesic with past endpoint 7(0) = 
(^Oj <^o)> -1^7) passes the boundary of a centred star of radius Thc > 
3m, then the angle defined in Fig. 7.2.3 satisfies 



si„K) = 

ro y 1 - 2m/r* 



Proof. Assume that 7 passes the boundary of the star at s = s*. Then 
(dr/ds)s. == 0 since r has a minimum there. Proposition 7.2.2 implies 



r*£^/|L| = \/l — 2m/r, 
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and the assertion follows by inserting this equation into Lemma 7 . 2 , 4 . 



Observe that the angle of the star is larger than one would expect 
in non-relativistic physics where sin(a^) would just be given by r^/rQ. 
This effect has been verified by a British team lead by Arthur Eddington 
(1882-1944) which measured the bending of light rays close to the sun 
during the total eclipse in 1919. They used the limiting behaviour given 
below. 



Proposition 7.2.4. Let 7 be an inextensible null geodesic which does 
not enter the region 2m /r > 1. Then there is a minimal radius ro = 
min{r o 7 ( 5 ) : s G M} along 7 . Furthermore, there are two lines with 
respect to the flat metric = dr^ + r^dl?^ which are asymptotes of'j 
and intersect at an angle A ~ 4m fro + 

Proof, The minimal value ro = r o 7 (^ 0 ) exists since 7 is inextensible 
and r 07 > 2m by assumption. We may choose our spherical coordinates 
0,ip so that the light ray lies in the plane 9 = tt/ 2 and the equation 
(^ 07 ( 50 ) == 0 holds. Proposition 7.2.2 implies that for s — > iioo the coor- 
dinate (p converges to limits (p±. The lines (with respect to which 
pass through the origin under these angles (f± are therefore parallel to 
asymptotes of 7 . The differential equation provided by Corollary 7.2.1 
can be solved exactly and has the solution 




1 

4. 2mo^^ + (^0)"^ ~ 2mo(ro)“^ 



dg 



We are interested in situations where the ratio m/ro is small. Since the 
angle A = 21im^_^o^(^) vanishes when m = 0 we will linearise (p{g) 
with respect to the parameter x — m/ro a*nd then take the limit ^ ^ 0 . 
Differentiating the function mfro^ p gives 



d{mo/ro) Ji/r„ + 2moQ^ + {ro)~^ — 2mo{ro)~^)^^^ 
and therefore 

f dip \ ^ p Q^-rp-^ .r-2 



Hence we have A = 4mo/ro + o(mo/ro). 
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The perihelion precession of Mercury, Mercury moves around the 
sun describing an orbit whlcli m slmUar to aJi ellipse but not closed.^ 
Already in the 19th century one has measured the angle between con- 
secutive local minima of the distance between Mercury and the sun and 
has tried to explain this angle within the Newtonian theory of gravity/ 
While such a “precession” occurs if one takes into account the gravita- 
tional fields caused by the other planets, this does not give a quantitative 
explanation of the measured value. The first outstanding success of gen- 
eral relativity was Einstein^s demonstration that his theory could explain 
this discrepancy/ 

In order to calculate the “missing angle” we have to compare the 
Newtonian solution of the two-body problem (the sun, Mercury) with 
timelike geodesics (Mercury) in the Schwarzschild solution which de- 
scribes the sun. 

In Newtonian gravitation, a particle in the gravitational field of a 
spherically symmetric star moves according to the ordinary diiferential 
equation 



d27(s) ^ m 



(7.2.10) 



Lemma 7.2.5, Let m > 0 and 7: E ^ be a solution of the differ- 
ential equation (7. 2 AO), Then the curve 7 is contained in a plane and 
Equation (7, 2 AO) is equivalent to 



Q 7(g) „ .2 
ds ’ 



d(f^ 



+ qn mjL^. 



(7.2.11) 



where polar coordinates of this plane and L is a constant. 

Proof Equation (7.2.10) implies that j{s) X ^/(s) (“x” being the vector 
cross product in E^) is constant with respect to s. Hence 7 is contained 
in the plane spanned by ^7(0) and 7(0). If (r, (p) are polar coordinates 
of this plane (x^ = rcos(p,x^ — rsin(^), Equation (7.2.10) is equivalent 
to 






m 

r^(s) ’ 



r(s) 



dV(5) 

ds^ 



, ^ dr(s) dy(s) 
^ ds ds 



= 0 . 



The second equation implies that = L is constant. Setting qm = 

1/r, the first equation is therefore equivalent to d?QN ldip^-\-QN = m/L^. 



® This is actuedly true for all planets, but in the case of Mercury the effect is 
especially pronounced. 

^ In order to do so, Astronomers have assumed the existence of a further 
planet. However, this planet has never been seen. 

^ He did this using the equation Ric = SttT before he arrived at his final the- 
ory with Ric — Scal/ 25 ^ = SttT. This was possible since for these calculations 
only the vacuum equation is needed. 
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Equation (7.2.11) is an inhomogeneous linear differential equation with 
constant coefficients. It is easy to see that (in the generic case L ^ Q) 
there exist constants ci, C 2 such that Qn{^) = sin((^) + C 2 cos((^). 

Our polar coordinates are only fixed up to a rotation in the plane. Hence 
we can assume without loss of generality that there is a constant e > 0 
such that ^ 

This solution is periodic. We could now attempt solve the corresponding 
equation in the Schwarzschild solution, and to calculate the difference of 
angle ip (modulo 27 t) between two consecutive minima of the coordinate 
radius as a Taylor polynomial in mo/ro- However, it is difficult to use this 
strategy in practice because it would involve integrals which are quite 
complicated. 

We will therefore employ a different method and obtain approximate 
solutions from approximate differential equations. Observe that Equa- 
tion (7.2.11) is the Newtonian analogue to Corollary 7.2.1 and that both 
equations differ only by the quadratic term mg^ = m/r^ which is very 
small. The idea is now to view the Newtonian solution as an approxima- 
tion to the relativistic equation. Inserting the Newtonian solution into 
the quadratic term gives a third equation 

+ fopprox = + 3m (^(l + ecosM))" 

which is also a linear inhomogeneous differential equation with constant 
coefficients. It appears to be a better approximation than the first differ- 
ential equation since the term 0 has been replaced by the term 3m (^iv)^ 
which should be a better approximation for 3mp^. While this argument 
is only heuristic a real justification appears to be too complicated to be 
worthwhile in our context. 

This third equation gives 

m 3m^ / 6^ 6^ 

^approx ^ + ecos(v?)) + ^ Y cos(2(/j) + e(psm{(f) 

To calculate the angle at the perihelion we have to calculate the minima 
of the function g{cp ) . The equation 

sin(^) + (I sin(2(/j) + sin{ip) + ^pcos{ip)'j . 

gives that ^approx has a perihelion at cpo = 0 — as was to be expected. 
A comparison with indicates that the next perihelion should be at 
ipi = 2n 6 where S is small. Hence we can neglect | sin(2(5) + sin(J) 
with respect to (27 t 4* 6) cos(27t + ^) and obtain 
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0 



me . 



3m^e 

“ 17 " 



(27t + 5) cos(5). 



With tan(5) S and neglecting 5 with respect to 27 t, this equation 
implies 






67rm^ 



which gives a correction to the Newtonian value in very good agreement 
with observation. 



7.3 Quasi-linear hyperbolic systems of equations 
in two independent variables 

In this section we prove a theorem about hyperbolic systems of partial 
differential equation in two independent variables which will be applied 
in Sect l.f. 

The material is very technical and of a different mathematical 
topic than the rest of this book The reader may wish to skip this 
section on first reading. 

For the following theorem we need some notation. If /: we 

call X x (x,f{x)) the 0-jet of f. The canonical 

projection ^ R, (t, g) g is denoted by pr 2 - 

Definition 7.3.1. Let h € x R^,R^) and let A G C^(R^ x 

R^,Lin(R^,R^)). The system of differential equations 

dtf + Ao f{f)dqf = ho f{f) 

is a quasi-linear system of hyperbolic equations in two variables if for 
every 0-jet (i, g, F) G R^ x R^ the linear map A{t^q,F) has k linearly 
independent left eigenvectors. The directions R(9t + where Xi are 
the left eigenvalues of A are called characteristic directions. The (un- 
parameterised) integral curves of the characteristic directions^ are called 
the characteristics of the system of differential equations ( and the given 
solution). 

The aim of this section is to prove the following fundamental existence 
and uniqueness theorem for quasi-linear systems of hyperbolic equations 
in two variables. 

Theorem 7.3.1. Let h G x R^,R^) cb'i^d let A G (7^(R^ x 

R^, Lin(R^, R^)) such that 

dtf + Ao f{f)dqf = ho /(/) 



^ Here we mean integral curves of vector fields which are tangent to the 
characteristic directions 
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is a quasiAinear system of hyperbolic equations in two variables. For 
any function fo € 6], R^) there is an open neighbourhood U of 

{0} X (a, 6) CM? and a unique smooth solution / : W R^ of the system 
of differential equations such that /(0,g) = /o(^) for all q G (a, &). 

The main part of the proof of Theorem 7.3.1 is contained in the following 
lemma. 

Lemma 7.3.1. Leth^X e (7'^(R^xR^,R*) anda < 6 € R. Assume that 
at any poinf (t, q) € R^ at least two of the numbers A^(t, q) are different 
For any function fo G C°°([a, 6],R^) there is an open neighbourhood U 
of {0} X (a, b) C 'M? and a smooth map / : W — > R^ such that 

(i) .f (0, <l) = h{q) for all q G (a, b), 

(ii) dtf + X^of(f))d,f:=^h^of{f). (ie{f...,k}). 

Moreover f the solution is unique. 

Proof We will first transform the system of differential equations into a 
system of integral equations and then employ an iteration technique in 
order to solve the system of integral equations. 

Assume that / is a solution to our system of partial differential equa- 
tions. For (s,p) G R^ and i G {1, . . . , fc} we denote by t 7(s,p)(0 the 
integral curve of the vector field dt + ^j^{f)dq with 7(s^p){^) ~ 

Prom the definition of and 

^ (/' ° + V o f{f)d,) = dtf + V o j\f)d,f 

we obtain the system of integral equations 

C(s,P) =/*(0,7(^p){0))+ /* /i‘°i“(/)°7(s,p)('r)dT, (7.3.12) 

J 0 

7(s,p)(«) = (s.P) + V o/(/) o7*^ pj(r)dr). (7.3.13) 

Conversely, if there are continuous maps f\ (i G {1, . . . , A;}) which 
satisfy this system of integral equations then they also satisfy the system 
of differential equations (ii). This follows since differentiation of Equation 
(7.3.12) implies the differential equation (ii). 

In order to solve the system of integral equations (7.3.12), (7.3.12) 
we will employ an iteration procedure. Let 

fo(s,p) = fo(p), 7«,p),o(*) = (s + t,p), 



and 
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FUi(s,p) - fU.rUp),rnm + f h* O f{Fm) o r^.p),„(r)dr, 

u 

4,P).tn+lW = (s,P) + (t, f of {Fra) ° 4.p),^(r)dr). 

J s 

We will show that these sequences of functions have well defined limits. 
These limiting functions will then solve our system of differential equa- 
tions. We will prove the existence of unique limits by showing that the 
sequences 

m 

FL{s,p) = Flis.p) + Y.{Ffs,p) - F;_i(s,p)), 
P^2{Fls,p),mif)) = PH{Fls,p)fi{t)) 

m 

+ - Pr2(C/,,p)j_i(i)) 

can be majorised by an absolutely converging series which in turn implies 
that they converge absolutely. 

To achieve this it is important to obtain first bounds on Fj and 
a > a > 0 let 

C'i=sup{||/o(9)ll + ||D/o(g)|l |9€K6]}, 

C -2 = sup {||Mi°(/)(i,9))ll + ||A(y(/)(i,g))|| + ||D/iO-°(/)(t.9))ll 

+ \\D\if{f)it,q))\\ : Hnp\f{t,q)\<2Ci, 

i 

{t,q) 6 [-a, a] x [a,b]}, 

C = Ci + Ca, 



and 



bia,a = q) € (-a, oc) X {a,b)\a + ^ < q < b - 



t L M 

a--<q<b + -}. 



We will solve the initial value problem in the region Ua,oi^ if a is chosen 
small enough. Our bounds imply 



Pr2(^C4^p).mW) 



<C (7.3.14) 



for ^ Uoc. Let (s,p) € Inequality 7.3.14 and the defini- 
tion ofUa,a imply O l^a,a> Hence during the iteration process 

The choice of a is less significant. The only purpose of its introduction is 
to guarantee the existence of the constant C 2 . 
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we do not leave the region Ua,a and our bounds Ci, C 2 are valued at all 
Let 

C=sup| -^F^{s,p) : ie [-a-s,a-s],i € 

<^m = sup| ^pr 2 (r/,,p),^(t)) : ^e[-a-s,a-s],te{l,...,fe}|. 
Assume that sup{|Fj(5,p)l : (s,p) e I4a,a,j 6 {0, . . ■,m}} < 2Ci. Prom 

dP d P / B 

+ Ds/iDaFm ^Pr2(L'(*^,p),m('^)))dr 

and 

+ D3A* D2Fm ^pr2(r(\,p),„,(r))dr 

we get 

C+i < CS^ + + CS^5^), 5^^, < 1 + a(CS^ + C5i5^). 

Let a < fqj^- Then these inequalities imply < 3(7, = 2 

if ^ and 6^ = 2. Since Sq < C and = 1 these bounds 
are valid for all m. We estimate for sup{|Pj(s,p)| : (s,p) e Ua,a,j € 
{0, . . . , m}} < 2Ci 

\F"m+l{s,p) - F^{s,p)\ 

<l/^(4.P).m(0))-/^(4.p).^-l(0))| 

+ flho f{Fm) O - h O O r^,p),„,^i(r)|dr 

J 0 

+ J\np { max{||(Dft)(j'>(P„,) o 
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t e [0, 5]}||/(F^) o 4,p),^(r) - o r^,p),^_i(r)|ldr 

+ C r (|pr2(r^,,),^(r)) - pr2(r^,,).^_i(r))| 

k 

+ ^ |Jl(s,p) O - F^_i(s,p) o 

j=i 

+ C (lpr2(r(',,p),,nW) - pr2(r(^.,p),„^-l W)l 

j=l 

+ l-C-llS-P) ° ^Up),m - ^L-l{s^P) ° r(s,p),m-l\))^'^ 
<c■l4,p),m(o))-r^,p),„^-l(o))| 

+ cj^ ((1 + 3fcC)|pr2(r^,_^),^(r)) - pr2(r^,.,),^_i(r))| 

k 

m ^m— 1 («>p)°r^s,p),ml)dr, 

j=i 

where in the last inequality we have used 

l^m-l(s.P)°-^(.,p),m - ■P’m-l(«.P) ° ^t,p),m-l\ 

< sup ||D 2 F^_i(s,p)|| \rl^p),m - ^(s,p),m-l\ 

<^C\ri^p),rn-ri,p),rn-l\- 

Analogously we obtain the estimate 

(s,?>),m+l ~ ^{s,p),m\ 

<cf ((1 + 3feC)|pr2(r4_^)_^(r)) - pr2(r^,,,),„_i(r))| 

J 0 
k 

+ ^ \Fi{s,p) O - F4_i(s,p) o 4.rt,^|)dr. 

i-1 

Let 

«m = sup||F4(s,p) -i4-i(s,p)| |(5,p) € Wa,a,ie 

Cm = sup {|pr 2 (r^^,p),m(*)) “ P^ 2 {^Up),m-li^))\ |(s,P) € U&,a, 
t € [—a — s,q; — s],i € {1, . . . , fc}|. 
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Prom our estimates we obtain 

em+i < {l+a{l+ZkC))Ce^+akCe^, e^+i < aC{l+3kC)e^+aCke^, 

We will now show that these inequalities imply 

^ {2kC^/a)'^ and < {2kCy/a)”^^/a 

if a is chosen small enough and 

sup{|F](s,p)l : (s,p) € Ua,a,j e {0, . . . ,m}} < 2C\ 

holds. In fact, we have ef < aC and ef < aC, so the inequalities hold 
for m = 1 if a < l/(2fc)^ < 1. If they hold for we get 

4+1 < (1 + Oi{\ + ZkC))C{2kC^TV^ + akC{2kCV^r 

= (1 + a(l + 3kC) + 

— cxC(l + 3fcC)(2fc<7\/a)’^\/S + OiCk{2kCy/a)^ 

= (a(l + 3kC) + fc^/Z^)i(2fcC^/5)™+^ 

Hence it is sufficient to choose a < min{l/(2fc)^, (2fc-l)^/(l+3fcC+fc)^}. 
Since \F^{s,p) - 



the sequences 

m 

F^{s,p) = F^{s,p) + ^(Fj(s,p) - F]_i(s,p)), 
i=i 

m 

Pr2(-C(s,p),m(*)) = Pr2(-^(s,p),o(*)) +Xl(P'^2(^(s,p),i(^)) 

j = l 

- Pr2(4,P)j-i(^)) 

converge if the series 4 and 4 

jorised by a convergent geometrical series if a < lf{2kC)^. Moreover, it 
is easy to check that 

m 

sup{li^j(s,p)| : (s,p) €Z^ 5 ,a,j € {0, ...,m}} < Cl + < 2Ci 

i=i 



if 



jCif 

4fe2C2(l + Ci)2 
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holds for every m. This proves imiform convergence of F^, Ff \ ^ if a 
is chosen small enough. 

This solution is clearly differentiable with respect to t and continuous 
with respect to q. In order to show that it is O’" with respect to q and t we 
can apply the same to argument to the prolonged system of differential 
equations which is obtained by differentiation of (ii) as follows. For I 
(Ze{l,...,r-1}) differentiate equations (ii) I times with respect to t 

and replace {dtYf (I € by gl'^ and {dtYf by This 

gives k{r — 1) equations of the form 

dtg'/ + ydggf - gl '\ '*) 

which we add to our system of differential equations. Differentiating 
equations (ii) I times with respect to q we obtain equations of the form 

^tidqYr + ^^'dqidqYP = 9qf , (dqYf)- We therefore also add 

the kr equations 



dtgi'^ + \%gi’^ - GYif,gl'\ 



>9q’‘) 



to our system of differential equations. Choosing the additional initial 
conditions 



(gt’‘)o(g) - (- ydqgl’‘ ^ + Cl’\f,gy, . . ■,gt’‘))(jg, 

(gt’\(g) = -><\0,q, fo(g)dqfo + h\0,q, fo(g)), 
igfUq) = {dqYhiq) 

we obtain a system of differential equations which is of the same form 
as (i), (ii). By construction, the first components /® of the solution of 
this system coincide with the solution of our original system. Moreover, 
g^/ = {9qYP> g\'^ = {9tYr continuous for all I which implies /* 6 
C'"(t^a,a,K). 

We will now prove uniqueness of the solution. Assume that {f, ^^j), 

(/> 7 (s, 3 ,)) both satisfy the system of integral equations (7.3.12), (7.3.13). 
Then we get 

= /^*‘(7fc,)W.r(7k,))W) -*‘(%,,)M./‘(7;.,,,(r))))dr 

<of ‘ (|7;.,rt(T) - ^;,,„(r)| + |/‘(7,‘.,,))(r)) - /‘(7;,.„ W)|)dr 

J 0 



and, analogously, 
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Jo 

Hence writing d(s, j/) = \lls,y)-%,y)\ + \Pi'rls,y)) ~ we obtain 



max{ci(T,j/)} < max{2C / d{T,y)dT} 

reOjS r€0,s Jq 

< max{2C5 max{<i(r, y)| 

r€0,s t€0,5 

This equation can only hold if d(s,y) ~ 0 for s < 1/(2C). 



< max{2C's max{oi(T, 2 /)|}} = 2Cs max{d{T,y)\}. 

r€0,s t€0,5 t€0,5 



Proof of Theorem 7.3.1. Let {h{t, q,F),..., lk{t, q, F)} be a basis of left 
eigenvectors of A{t, q, F) and denote their eigenvalues by X^{t, q,F), ..., 
X^{t,q,F). Multiplying the differential equation from the left by T we 
obtain Tdtf + XH^dqf = l^h or, equivalently, 

dtiVf) + ^%iU) = f + Kf 

= {Th) + {Dik + Dzkdtf)f + Xi{D2k + D3hdqf)f 
= {Th) + {Dik)f + Xi{D2k)f + {D3kh)f. 

This system of differential equations is of the diagonal form treated in 
the Lemma 7.3.1 above. Hence the assertion follows directly from this 
lemma. ® 



Corolleury 7.3.1. In addition to the assumptions of Theorem 7.3.1 let 
[a,/3] C [a, 6] and to > 0. The solution f of the quasi-linear system of 
hyperbolic equations at (to,qo) € IK^ depends only on the initial data 
restricted to [a, ^3] if and only if all characteristics intersect {0} x R in 
the subset {0} x [a, 0\ . 

Proof This follows immediately from the integral representation (7.3.12), 
(7.3.13). ■ 



The following corollary will only be used in Sect 9.5.1. 

Corollary 7.3.2. In addition to the assumptions of Theorem 7.3.1 as- 
sume that A is constant and that the map h is linear in f. 

The characteristic directions do not depend on the solution. More- 
over, lei 2)([a, 6]) be the set of all (t, g) € R x M such that all character- 
istics through {t,q) intersect the set {0} x [a, 6]. Then there is a unique 
solution f which is defined on all o/S“^([a, 6]) 
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Proof. It is clear that the characteristics do not depend on the solution 
since A does not depend on it. We denote the characteristic through (s, p) 
to the ith eigenvalue A* by 7(s,p)- The non-trivial part of the corollary is 
to prove that the solution which (by Theorem 7.3.1 may only be defined 
locally) extends to all of 2)‘*'([a, 6]). This can be achieved by applying a 
slightly refined version of Theorem 7.3.1 repeatedly. To this end we will 
need better estimates in the proof of Theorem 7.3.1. 

Assume that the initial data satisfy |/^(g)| < C\ for all q € [a, 6]. 
Since h is linear with respect to / there are functions h\ such that 
q)) = h;(t, q)f{t, q). It follows that there is a constant 

K eR such that 

k 

K > sup{J2mt,q)\ + |Ai + \\mi{t,q)\\) : (t,g)2)([a,6])}. 

i,l=l 

Observe that K is well defined since S([o, b]) is compact. Unlike in the 
proof of Theorem 7.3.1 the inequality defining K is independent of the 
solution. We set Ua — {(t, q) € S([a, 6]) : -a < t < a}. 

Recall that — 7(«,p) for all m. The estimate for \Fm+i{s,p) — 

F^(s,p)| in the proof of Theorem 7.3.1 simplifies to 

=0 

\FUAs,p) - F^{s,p)\ < |/^(7(.,p)(0)) - ro{lts,p){0))\ 

■^0 1=1 
< akKe^, 



where 

e^ = sup{|F^(s,p)-F^_i(s,p)| : (s,p) & T){[a,b]),l € fe}}. 



This estimate impUes the recursive inequality < akKe^. Prom 

ef < akKCi we get < Ci{akK)”^ and therefore Yl'^=i 5 i-akK ' 
Using F^{s,p) = Fo*(s,p) + J2^=i(Fj{s,p) - F^_i{s,p)) we obtain the 
bound 



iF4(s,p)i<ci+;^€f < 

j^l 



Cl 



1 — akK 



It follows that for a < l/{2kK) there is a solution defined in all of 
Wq,. Since the number l/{2kK) is independent of the solution we obtain 
our global solution by successively solving 2kK max{s : 3p with (s^p) € 
D([a, 6])} initial value problems. The proof of uniqueness and differen- 
tiability is exactly as in the proof of Theorem 7.3.1. I 
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Remark 7,3.1, Observe that Corollary 7,3,2 is still correct if we replace 
the initial curve {0} x [a, b] by an arbitrary curve C which is intersected 
by each characteristic at most once and replace 2)([a,6]) by set 25(C) of 
all (t, g) 6 M X R such that all characteristics through (t, q) intersect the 
curve C. 



7.4 The initial value problem 

for spherically symmetric perfect fluid spacetimes 
with non-interacting electromagnetic fields 

In this section we discuss the initial value problem (cf. Equations 
(7.1.3) -(7.1. 6) J in some generality for a spherically symmetric space- 
time which represents a perfect fluid. 

Since the section is quite technical and requires the results of 
Sect, 7,3 the reader may wish to skip it on first reading. 

The Schwarzschild solution is a good description for the exterior of an 
isolated, spherically symmetric, non-rotating star. Here we wish to solve 
Einstein^s equation for the interior of such a star. The complete model of 
an isolated, spherically symmetric, non-rotating star is then usually ob- 
tained by matching the interior and the exterior solutions at the bound- 
ary of the star. This will be done in Sect. 7.5 for the special case of static 
stars. 

The system of Equations (7.1.3)-(7.1.6) is highly non-linear and ra- 
ther complicated. Observe that the assumption of a perfect fluid (prad = 
Psph) allows us to integrate Equations (7.1.7) and (7.1.8) directly sub- 
stantially simplifies the problem. This simplification is unaffected when 
we include a non-interacting electromagnetic field. 

In the following we will first study electromagnetic fields in spheri- 
cally symmetric spacetimes and then discuss the initial value problem 
for a spherically symmetric spacetime which admits a perfect fluid and 
a non-interacting electromagnetic field. 

Readers who have not read Sect 5,2,3 on Maxwell's equation may 
wish to skip the material up to Lemma 7.4,2 and assume Tei = 0, 
e = 6 = 0 in the following discussion. 

Recall from Sect. 5.2.3 that the source free Maxwell equations for an 
electromagnetic field are given by 



dF = 0, 
div(F) = 0, 

where F is a 2-form. The electromagnetic part of the energy momen- 
tum tensor reads then (Tei)a6 = ■ii^ig'^FacFbd - i (F,F) gab)- Given a 
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spherically symmetric spacetime {M,g) and any 2-form F satisfying the 
source-free Maxwell equations one could define Tmatter = T ~ T^i. Of 
course, the matter represented by Tmatter would in general be quite ex- 
otic. Moreover, it is possible that neither Tmatter nor Tei are spherically 
symmetric. This discussion indicates that we should impose additional 
conditions in order to describe physical electromagnetic fields. Given that 
(M, g) is spherically symmetric, the most natural additional assumption 
on the energy momentum tensor Tei would be to demand that it is invari- 
ant under rotational isometries and that F is well defined in sufficiently 
large open sets containing complete spheres of symmetry. 

Lemma 7.4.1. Let (S x S^,g^ + r^dl?^) be a spherically symmetric, 
4-dimensional Lorentz manifold, ps be the volume form of{S,gs), and 
Ps^ the volume form of (5^, df?^). If F € x 5^) satisfies 

dF = 0, div(T) = 0, 

and (Tei)afe = -^{g'^'^FacFbd-^ (F,F) gab) is spherically symmetric, then 
there eodst constants e, b with 

F = PE + b{ns^)*ps^- 

The corresponding energy momentum tensor is given by 

s ^ ® ® <3‘) + 

Proof. We consider the orthonormal frame {U,Q,E2,Ez}, where 

E2 = -de and E3 = — 

r rsm{9) 

Spherical symmetry of Tq\ implies 

T,i{E 2,E2) =Te^{Es,E3) 



Te,{f/, E 2 ) = Tel(17, Ez) = Tel(Q, E 2 ) = Tel(Q, E3) = Tel(^2, E 3 ) = 0. 

All other components of Tei are unconstrained (cf. Lemma 7.1.1). 
Since 47r(Tei)a6 = 9'^‘^FacFbd — 3 {F, F) gab, we get 

0 = 47rTei(£^2, E 2 ) — ^'JtTe,\{Ez, Ez) 

= -F{E 2 , uf -f F{E 2 , Qf + F{E 2 , Ezf 
- i-F{Ez, C/)2 + F{Ez, Q)^ + F{Ez, Ez)^) 

= -F{E 2 , U)^ + F{E 2 ,Q)^ + F{Ez, 17)2 _ g)2 
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and 



0 = 47 rTei(^ 2 , £^3) = ~F{E2, U)F{E^, U) + F{E2, Q)F{E^, Q), 

Multiplying the first equation with F{Es^ f 7 )^ and inserting the second 
equation we obtain 

0 = -F(E 2 , QfF{E^, Qf + F{E 2 , QfF{Ez, Uf 
+ {F{E2, Uf - F{E3,Qf)F{E3, Uf 
= {F{E3,Uf - F{E3,Qf){F{Ez,Uf + F{E2,Qf). 

If F{Ez,Uf - F(E3,Qf 7^ 0 we have F{E^,U) = F{E2,Q) = 0. In- 
serting this into Te\{E2,E2) - Tf,i{E3, E3) = 0 gives then -F{E2, Uf - 
F{E3, Qf = 0 which in turn implies F{E2, U) = F{E3,Q) = 0 . In par- 
ticular, we have shown F{Es^ U) = F{Ez^ Q) = 0 which contradicts the 
assumption F{E^, 11)^ - F{Ez^ ^ 0. 

If F{Es,U)^ - F{E^,QY = 0 we get F{Es,U) = r}F{Es,Q), where 
Tj e {” 1 , 1 }. In the first case the equation Tei(£?2j£^3) = 0 implies that 
F(E2, U) = rjF{E2, Q) or F(Es, U) = F{Es, Q) = 0. In the second case 
we obtain the same conclusion from the equation 0 — 47rrei(E2,£^2) — 
47 rTel(-E 3 ,£^ 3 )- 

The equations Tei(Q, £^2) = T^l{Q, E3) = 0 imply now 

0 = ^f{q,u){t]F{E2,q)) + (-f(E3,q))f(£;2,£;3), 

- r?F([7, Q)F(F 2 , Q) - F(F 2 , Es)F{Es, Q) 

0 = ”F(Q,C/)(r7F(F3,g)) + (~F(F2,Q))F(F3,F2) 

= 7]F{U, Q)F{Es, Q) + F(F2, F3)F(F2, Q) 

This is a linear system of equations for F(F2,Q),F(F3,Q). Since the 
determinant of the associated matrix is F(C/, Q)^ + F(F2, Fs)^ we have 
either F(F2,Q) = F{Es,Q) - 0 or F([/,Q) - F(F2,F3)^= We 
have therefore two possible cases. There are functions A^B: M — ^ M 
such that either F = Ar{U^ + vQ^) A d 0 + Br sm{6){U^ + vQ^) A d^ or 
F - A + Br^ sm{d)de A dUp, 

In the first case let a: 6 F x 5 ^. Then 

F{'KEix),y)i-E' + ??Q,*) = 2r{x){A{'KE{x),y)de + B{7rEix),y)sin{e)d(f) 

defines a 1-form on the sphere of symmetry Sx {T^x^ix)} x 5^. Since 
5 ^ does not admit any non- vanishing vector fields, this 1-form must 
vanish at some point yo of 5 ^. This implies £(7rs(a:),yo) ~ ^ there- 
fore (£ei)(7T2:(a;),z/o) ~ Since Tel is spherically symmetric we obtain 

The equations Tei(F, F 2 ) = Tei(F, F 3 ) = 0 do not give any more informa- 
tion. 
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(^ei)(irj;(a:),j;) = 0 for all {■Ks{x),y) £ Sx- By arbitrariness of x we finally 
have Tel = 0. 

In the second case the electromagnetic field is given by F = AU^ A 
Qb + Br^ sin(0)d6' A dy>. Since d(17'’ A Q*") = 0 and d(sin(^)d0 A d^?) = 0 
we obtain 

dF = deAde AU^ A + d^Adap A C/'’ A 

+ dt{r^B) sinMt A d^ A d</? + dq{r^B) sin 6dq AdO A dyi. 

It follows that dF = 0 is satisfied if and only if A depends only on t and 
g whereas r'^B depends only on 9 and p. Since div(F) = 17* A + 2F«lnr, 
div((5) = <3 • t/ + 2Q • Inr, and [F, Q] = {Q * u)U - {U • \)Q we get 

div(F“) = div(-i(17 ® Q - Q ® F) + ®dq,-dq,® de) 

H Sin (c') 

= -di(F)g + di(Q)F - A{div{U)Q - div{Q)U + [U, Q]) 

^ r2 ski(g) 

= A((^ •In A + Q • V + 2Q • In r — g • z/) F 
+ i( - F • Ini - F • A - 2F • Inr + F • A)g 

J_ f B ^ ^ cos(0) B i Pi f B 

^ yr^sin(0) j sin(0) sin(0) yr^sin(^) j 

Hence the equation div(F) = 0 is equivalent to 

F • In(ir^) =0, g • ln(ir2) = 0, dq,B = 0, = 0. (7.4.15) 

It follows that there are constants e, 6 e M with F = A + 

6sin(0)d0 A dp. We calculate (F, F) = — ^ + ^ and get 

2 7 2 

47 rT = ^(F*- ® F^ - g^ ® g^) + ^(dO^ + sin^ Odp^) 

1 / 2e^ 2/)^\ 

- J ( — ^ + -jx J ® F*" + g*" ® g'’ + r^(d^^ + sin^Fdyz^) 

t) ®u^ + Q'® Q') 

(t t) 
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Observe that assuming dF ~ A where A is some spherically independent 
1-form would have lead to 6 = 0 since there is no non-vanishing vector 
field tangent to S^. 

Lemma 7.4,2. Let T = eU'’ ® + PradQ^ ® <5'’ + Psphr^it,q){de^ + 

6 (Lp^) + Tei. Then Einstein's equations are equivalent to 



[7 . 17 • r = -(Q • r) ^ ^ 

e+Prad r e + Prad 



I \ 

- 47rr ( Prad - ^ 1 + 



e^ + 62 



27rr^ ’ 

,, ^ ,rr ^<9•Prad „ (t/ • r)(Q • r) Prad ~ Psph 

u • Q mr = -{U •rj — ; 2 ; , 

e + Prad r e + Prad 

, ,Q»U •r .U »r 

u »e = -(e+prad) 2(e + Psph)- 



Q »r 

Q »m = iirr^iQ • r)(e + ^ + 

07T 



+ b\ 



87rr^ 



(7.4.16) 

(7.4.17) 

(7.4.18) 

(7.4.19) 



where m = §(1 + {U • r)^ — (Q • r)^)- 

As a consequence, of Einstein’s equation the equations of motion, 

U»r 



U •e = -{e-\-ptad)U *A-2(e+psph)- 

f 

Q • Prad = ”(c “b Prad)Q • U — 2(prad “ Psph) 



Q^r 



hold. 

Proof. The energy momentum tensor is given by 






t e^Ab\ 

+ (Psph + ) 



87rr^ 

^ \ ^2 _ j _ Q ^^ 2 ^ 






Writing 

^ H- 6^ „ -f 

We can apply Lemma 7.1.2 with e,pradjPsph replaced by e,Prad,l?sph' 
Observe that the equations of motions (7.1.7), (7.1.8) are a consequence 
of they system of differential equations (7,1.3)-(7.1.6). Equation (7.1.6) 
is the only equation in the system (7.1.3)-(7.1.6) which involves the 
function Psph- Since this equation as well as Equation (7.1.7) can be 
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solved for psph, we can replace Equation (7.1.6) by Equation (7.1.7) in our 
system. Using Equation (7. LB) (wHrh m fsow a consequence of Equations 
(7.L3)-(7.L5)j (7.1.7)), the definition of m, and the commutator relation 
[i7, Q] = (Q • u)U ~ {U • X)Q imply 

U»U*r = -(Q • J-) C * ~ PsphKQ » r)/r _ m 

e + Pr~ad 

- 47rr(ftad - 

= 2 (C * Pr'ad - Psph _ m 

e + Pr~ad r € + Prad 

- 47rr(pr'ad - ^), 

[/•g,r = Q*C/*r + ((5*t/)(t/*r)-(?7.A)(Q*r) 

= _({/ , * Pr~ad + 2(pr~ad ~ Ps~ph)(Q • r)/r 

€ “1" Prad 

= (C » r) ^ * Prad ^ (U»r)(Q» r) Pr~ad - Psph 

e + Prad r e + Prad 

It follows from the calculation in the proof of Lemma 7.1.3 that we can 
replace Equation (7.1.3) by Q • m = 47rr^(Q • r) e. 

Equations (7.4.16)-(7.4.19) follow now from 



^ H" Prad — ^ L Prad j 
Q*Prad =Q»Prad ~Q 

= Q«Prad+4-g^(Q.r) 

Prad Psph — Prad Psph 2 i 

/} ~ /I + 6^ 

47rrprad = 47rrprad - 

Finally observe that as a consequence of these equations the Equations 
(7.1.7) and (7.1.8) hold unchanged, even if 4- 6^ 7 ^ 0. I 

The system of differential Equations (7.4.16)-(7.4.19) is singular at r == 
0. It can be shown that in the case e ~ b — 0 this singularity is only a co- 
ordinate singularity, provided, the initial data can be smoothly extended 
to r < 0 as symmetric functions. A proof, however, would require the so- 
lution of a mixed initial- value-boundary-value problem (cf. (Courant and 
Hilbert 1962; Muller zum Hagen, Yodzis, and Seifert 1974)). If e^-h6^ 7 ^ 0 
then there is physical singularity at r = 0. This follows immediately from 
the fact that the invariant function p[ 2 ](T, T) blows up as r — > 0. In the 
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following theorem we will solve the initial value problem in a region 
which does not contain the centre of symmetry r = 0. 

The initial value problem is not yet “well posed” . In order to trans- 
form the system of differential equations into a quasi-linear hyperbolic 
system of equations we write y = U •r and T = Q • r . Adding these two 
equations two our system of equations we obtain 



and 



U •r=:^y, 

jj ^ 2^ Prad ~ Psph ^ 

^ Prad ^ ^ d" Prad 



47rr Prad 



[7 * ^ * Prad __ 2 Prad Psph 

Prad ^ C H“ Prad 

U • e~ —(e Prad) ^ ^ — 2(c + Psph) ^5 



(7.4.20) 

(7.4.21) 

(7.4.22) 

(7.4.23) 



(7.4.24) 

e.m = 4,r»r(e+^ + !^). (T.4.25) 

The first 4 equations constitute a quasi-linear hyperbolic system of equa- 
tions for r^y^r^e. It will turn out below that last 2 equations will hold 
everywhere if they hold initially. This leaves us with two undetermined 
functions, Prad> and Psph- In order to arrive at a well posed system of 
equations we could either augment the system with two more differential 
equations which relate the pressures Prad^Psph to our remaining quanti- 
ties or we could impose functional relationships. We will opt for the 
latter possibility and assume prad = Psph since in this case we can solve 
Equations (7.1.7), (7.1.8) explicitly which greatly simplifies the problem. 
Furthermore, we will assume a functional relationship p^ad = p(e) which 
describes the physical properties of our fluid. This equation is referred 
to as an equation of state. 

Theorem 7.4.1. Let M \ {0} x coordinised by functions 

4 G M; and let p: M M. be a given monotonically increasing 
smooth function. 

For any e, 6 € M and any smooth functions r : R, e : R"^ — > R*^ 

such that e(q) + p o e{q) > 0 for all q e there is a neighbourhood U 
of the hypersurface {7q} x R"^ x 5^ and a unique Lorentz metric g on U 
such that 

(i) g satisfies the spherically symmetric Einstein equation with cos- 
mological constant A for a perfect fluid with equation of state p(e), 
a source- free electromagnetic field with parameters e, b; 
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(ii) €|t=to = e; 

(Hi) r\t=to = r- 

Proof. It is sufficient to prove that for all ri > 0,r2 > n there is a 
neighbourhood Wn,r 2 of {^ 0 } x x a unique solution g 

defined on Z4-i,r2* 

In a first step we will show that — up to reparameterisation A 
h{q)X, ly — the initial value problem has a unique solution 

such that (r, e) coincides with (r , e) at t = to . To this end we introduce 
two new dependent variables F — Q • r and y = U • r and augment 
the system of equations by the definition of y. This gives the system 
of Equations (7.4.20)-(7.4.25). The additional initial values F and y for 
F, y are calculated from the necessary initial, “constraint equations” 



f 

2 




2 r 



F^Q^f, 



In the first step we will show that there is a unique solution to the initial 
value problem (7.4.20)-(7.4.23). In the second step we will show that 
this solution also satisfies Equations (7.4. 24), (7.4.25). 

Equations ordinary differential equations (7.1.7)), (7.1.8) are conse- 
quences of Equations (7.4.20)-(7.4.23) and can be solved independently. 
In order to simplify the formulas, we define the “baryon number density” 
n by 

de _ e+p{e) 
dn n 

and the asymptotic behaviour t{n)/n 1 (e 0). It follows that 
fo 17^1) “ which in turn implies 



Ume 
e -1- p(e) 



U ♦ ln(n) 



and 
Q •In 



/ e + p(e) \ _ n f e+p{e) \ 

\ n ) e + p{er \ n J 



e + p(e) 

n ( Q*e + f^Q»e _ (e +p(e))^Q » e ' 
€+p{e) 

Q^e 



n 






1 4 . ^ _ A = 

de J e + p(e) 



e+p(e) 

Since Equations (7.1.7), (7.1.8) are equivalent to 



U • ln(n) — 



U*e 
e + p 



= -U»\-U» In(r^), 
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<5 • In 




Q*P 

e + p 



-Q*v, 



we get 

e-* = M5)Aande- = ^. 

where ft,, H are constants of integration. The system of equations (7.4.20)- 
(7.4.23) is equivalent to 












/o 


0 


0 


0 \ 


fr'\ 


0 


0 


0 


rA 


y' 


0 


0 


0 


yA 


r' 


\0 


B 


0 


0/ 


V) 



Hny 

e+p 



Hn 

e+p 



- 47rr(p - 

0 



>) 



V 



-2Hnl 

r 



where a := 9ta,a' := dqU^ and 






B = hH 



(nr)^ 



The matrix 



/O 0 0 0 \ 

0 0 0 

0 0 0 yA 

\0 B 0 0 / 

has left eigenvalues ai and left eigenvectors k given by 



ai/2 = 0, Q!3/4 = ±VABF^ 

h = (1, 0, 0, 0), Zs = (0, y, -r, 0), Za/4 = (0, 1, 0, 



B 



The left eigenvectors h.hJs^ and U are linearly independent unless F = 
0, € + p{e) =: 0 or ^ = 0. Hence our system of differential equations 
is hyperbolic and admits a unique, local solution in Un,r 2 Theorem 
7.3.1). By the uniqueness of the solution we can choose a collection of 
such intervals (ri,r 2 ) which cover all of R+ and patch the solutions with 
respect to these intervals together. This gives a unique solution in a 
neighbourhood of the entire hypersurface {to} x R+ x 5^. 

We will now show that this solution also satisfies Equations (7.4.24) 
and (7.4.25) everywhere. We have chosen our initial data T, y so that 
they hold at t — to- From 
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U*{r-Q*r) = U»r-Q»y-[U,Q]»r 



C/ # r> i _ jy 

-y~ j (/ • g + 2 — - — (Q » u)y 

e+p e+p r 

(c/ • A)(g • r) + (c/ • A)r - {u . A)r 






+ C/.A + 2^ 
€ + p r 



• \{Q •r - r) = U • \{Q »r - r) 

we obtain a linear differential equation for the function {F — Q^r). Since 
r—Q»r = 0 initially this equation therefore holds everywhere. The other 
constraint equation (7.4.25) is slightly more comphcated. Observe first 
that Equations (7.1.7), (7.1.8) imply [U,Q] = q. 

The proof of Lemma 7.1.3 shows that the equation U •m ~ 47rr^r(e + 
is a consequence of Equations (7.4.20)-(7.4.23). We have 



U • Q^ { 47t 



€ + — dr 



1 + 6^ 



= • 47T ( € + 






H (U • m) 

e + p 

= SnryF (c + ^ 



U • e 2y\ . _ . 

—— + — ) (g • m) 
£ +p r 






(f/.r) 



*f 47rr^(f/ • c)r 4- 47 tQ • r^y p - t 



A + 6 ^ 



Stt 87rr^ 



47rr^^ p - 



Stt STrr^ 



-{Q • m) 



{ yl \ 

8^ryr[e + -) ^yF 
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\ 



-f 47rr^ 



-f SurFy 






/ 



/ ? 



£ \ 



A + 6^ 



P--^- 



Stt Snr^ 



2 , _ , / A 



+47rr^2/(Q • P) +2yr 



e"' -f 6^ 



^ ^ ^ 47rr^y ( 



e+p V 



A e^ + 6^\\ ,Q*y 



Stt STrr^ 



)) 






8^rry(6+A_(e + p)+p-A) 



+(Q*2/)(-47rr 



,e+p 



+4"- (e • ») (p ^ S - -^3") + — ) 

4 



+tor^s(« • P) ( - (« + ^ + ^8^) (;^) 



+ 1 



e"' + 6^ \ \ + 6^ 



1 / ^ + 6^ \ \ 

e + p \ Stt 87rr^ / / 



47rr^ e + — + 



g2 _|_ ^2 



Stt Sirr^ 



+ -^yr{-l-l^2) 

-\-Q •m 



We get a linear differential equation for 47rr^r(€ + ^ + ~ Q • ^ 

and 47 rr^r(e + ^ + = <5 • m holds everywhere since it is initially 

satisfied. I 
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Observe that the monotonicity condition on p is necessary to obtain 
a hyperbolic system of differential equations. The function can be 

physically interpreted as the velocity of sound. This indicates that ^ > 0 
is physically well justified assumption; 



Corollary 7.4.1. The characteristic directions of the Einstein equation 
are 



t/, z = 17 -h 




- 




It follows that for ^ > 1 



inforwMion can travel faster than light 



Proof ABP = h^H^(nr)^/{e+p)^^ = implies dt±y/A^dg = 

U ± - The second assertion follows from Corollary 7.3.1. I 




Remark 7.4 A, The system of differential equations is especially simple 
in the case of dust: p(e) = 0. Then it reduces to the following system of 
ordinary differential equations. 



U mr 






1 + 

2r 



+ 



rA 



where P does not depend on t. The equation for the energy density e 
decouples and can be calculated from U • e — ~ 2e^. 



7.5 Static perfect fluid stars 

Most stars do not change very much over long time spans. It appears 
therefore reasonable to assume that their interior can be described by 
static, spherically symmetric solution to Einstein’s equation. Static solu- 
tions should be an even better description once these stars have burned 
all of their fuel. In this section we will show that the assumption of static- 
ity has a striking consequence: there is an absolute upper bound for the 
mass of a static, spherically symmetric star. Further, this bound is so 
small that it is exceeded by a multitude of known stars, which indicates 
that many of these stars will collapse into singularities once their fuel is 
exhausted. 

In this section we will model a non-rotating star by a spherically sym- 
metric, static perfect fluid spacetime. Under the assumption of staticity, 
Einstein’s equation for a perfect fluid reduces to the following ordinary 
differential equation. 
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Theorem 7*5*1. Let (M^g) be a spherically symmetric 4-dimensional 
spacetime which is and piecewise smooth, and assume that there exists 
a timelike Killing vector field U such that the energy momentum tensor 
is given byT = {e-^p)U^ +p g , where e, p are given, smooth function. 

If the solution has a well defined centre of symmetry r = 0 then p 
satisfies the Tolman--Oppenheimer--Volkoff equation 

^ . m{r) + inr^jp - £) 

dr P ^ _ 2m{r)) ’ 



where m{r) = 47t (e(r) 4- df. 

Conversely, let e: p: given continuous func~ 

tions such that 

(i) t and p can be extended to R" as smooth, even functions, 

(a) e and p vanish for r > ro; 

(Hi) e{r) + p{r) > 0 for r <ro, 

(iv) e and p are smooth for r <ro, 

(v) e andp satisfy the Tolman-Oppenheimer-Volkoff equation, where 



m(r) = 47 t 





df. 



Then there exists a unique 4- dimensional, spherically symmetric Lorentz- 
ian -manifold {M,g) which is piecewise smooth and satisfies 

(a) Ric - |Scal^ = 8iv{eor ~\-por)U^ ® + {por)g, 

(b) U is a timelike Killing vector field. 

There is an mo > 0 such that for r > ro this spacetime is isometric to 
a spherically symmetric vacuum spacetime with cosmological constant A 

7 

and mass mo + ■ 

Proof. Equations (7.1.8) and (7.1.5) imply 



(Q»r) 



Q*P 



e + p 

Since by definition of m, Q • r = 



(“+4„(p-l)), 

~ the validity of the Tolman- 



Oppenheimer-Volkoff equation follows from ^ 

For the converse we have to check that the Tolman-Oppenheimer- 
Volkojff equation implies that there exist functions u{r),X{r) which sat- 
isfy Equations (7.1.3)-(7.1.6). 

Observe first that Equation (7.1.4) is trivially satisfied. Equation 
(7.1.3) is equivalent to 



r dr r 



2 
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which implies m(r) = |(1 - (Q®r)^) ~\~A{t) — 
A(t) is a const sant of Integration. Froiri 
therefore 




1(1 -e + where 
™ 0 we get A{t) =0 and 

(7.5.26) 



The 4-dimensional solution should be smooth at r = 0. Since a ro- 
tation by 7T is an isometry it is clear that the metric must be invariant 
under the transformation r i-^ ~r. But this is equivalent to A and u 
being even functions of r. We also need that A satisfies A(0) = 0 because 
for any other value we would get a conical singularity. In fact, consider 
a centred sphere with area A{r) and (geodesic) radius R{r), In the limit 
r — > 0 we obtain 



lini Atl = liia l![!f 

n^oR^ir) r^o(/;(Q.r)-idf)2 



47t 2 

3 2(C.r)-2|^^o 



47T 

T 



e-2A(0) 



which reduces to the Euclidean relation in the tangent space at the centre 
of symmetry if and only if A(0) = 0.^^ 

To see that our conditions are sufficient for the smooth extendibility 
of p to r = 0 consider the coordinate transformation = r cos 9 cos (f>, 
—T cos 9 sin <l>j =r sin 0. Then the metric is of the form 



9 - e^-dt" + 



o2A , 






^ x^x^dx^dx^ + ^(da:^)^ 



i=l 



Observe first that there are smooth functions A in a neighbourhood of 
{x^^x^^x^) = 0 with 



i>{x^ ^ x"^ j x"^) = !/{ 



\ 



1 

i=l 



^(a:»)2), A(x\x^,x^) = A( 









i=l 



if and only if and A are even functions. Assume now that A is even. 
Then the Taylor series of - 1 is a series in the variable 
The equation A(0) =0 implies therefore that the quotient 



(x^,x^^x^) 



^2A 






-1 






well defined at (x^^x^,x^) = 0 and smooth. 

If A(0) > 0 we would get the analogue of the tip of a 3-dimensional cone. This 
can be visualised in the 2-dimensional case with the sphere being replaced 
by a circle. 
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We obtain A(0) =0 from Equation (7.5.26) since m vanishes to third 
order at r = 0. The function A is even because of (7.5.26) and the fact 
that the integrand e ^ of m is even. 

Equations (7.1.5) simplifies to 

^ - 47rr (p - = 0 

\ Stt J 



which, using e = 1 — 2m/r, is equivalent to 



o , ^ {p - £) 

^ r(r — 2m) 

We can therefore determine Q up to a constant of integration. Observe 
that 1 / is even since the function 

{p - £) 

r(r — 2m(r)) 

to be integrated is uneven. The Tolman-Oppenheimer- Volkov equation 
implies now the equation of motion (7.1.8). Since this equation is inde- 
pendent from Equations (7.1.3)-(7.1.5) but Equation (7.1.8) is a conse- 
quence of Equations (7.1.3)-(7.1.6) we can derive Equation (7.1.6) from 
the system of Equations (7.1.3)--(7.1.5), (7.1.8). Let 



mo 



lim m(r) — 
r— >ro>r<ro 



6 ■ 



If we extend A beyond ro using = (1 — ^ fox r > tq 

then A is continuous. It is Ci if and only if 

ylr^ 

lim r ro , r < roQ • m(r) = — — . 



Since Equations (7.1.3)-(7.1.6) are satisfied we can appeal to Lemma 
7.1.3 and infer that A is if and only if 47rro^(e(ro) -f ^) = 
which is equivalent to e(ro) ~ 0. We choose the constant of integration 
for 1 / such that we have == 1 — and extend the function i/ 

ro o 

such that — 1 - ^ for r > ro It follows that i/ is continuous. 

The function u is if and only if 



We calculate 



lim dr{e^^) 
r— >ro,r<ro 



2mo 



2Arp 
3 ■ 



lim dJe^’') = 2(e2‘^('’<'>) 
r^ro,r<ro 



lim drl^ 
r— tro,r<ro 
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= 2(1 - N ^(^o) + ^^^0^ (p(^o) ~ £) 

ro ro(ro - 2m(ro)) 

2 / Aro^ . 3 / \ ^ro^\ 

^ ~ ^(’^o) - — j • 

Hence z/ is C^ if and only if p(ro) ~ 0. 

Uniqueness follows from Theorem 7.4.1. I 

Observe that we have used all our freedom in order to construct a spheri- 
cally symmetric spacetime with exterior vacuum solution. In general, 
it is impossible to achieve higher differentiability. Even if staticity is 
not assumed, a smooth matching of the interior solution to the vacuum 
spacetime would imply properties of the boundary which are so strong 
that they could be used in order to calculate the collapse of the star 
explicitly (Kriele 1995). 

Independently of the pressure, there is an upper mass limit for spher- 
ically symmetric stars. 

Theorem 7.5.2. Let {M^g) be a spherically symmetric static, perfect 
fluid spacetime with positive, radially decreasing energy density (e > 0, 

i<o;. 

(i) Let Vc > 0. Then the mass associated with the radius rc satisfies 
the inequality m{rc) < 5rc/9. If, in addition, we assume that p — 
~ > 0 then the more stringent inequality m{rc) < 4rc/9 holds, 

(ii) Assume that there is an equation of state p: e t-> p{e) and let 
€c > 0. Then there is an rric G M which only depends on the equation 
of state for low energy densities, p: [0, ej ^ M such that m(r) < me 
for all r e 

Proof Equation (7.1.6) is equivalent to Q • Q • + (Q • ^)^ = -2m/r^ + 
47r(e +p) which (together with Equation (7.1.8)) implies 

<3 . = e- (i + P)) - 




where for the last equality we have used Equation (7.1.5). Since de/dr < 
0 we have 

m(r) — Att J ^e(r) + f^dr 
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> 47t ^e(r) 4- J f^df = 47 t ^e(r) + r^/3. 

Hence Q • <0 and an integration yields 

T Tq 

for r < Tc. From Q»v = • r we obtain 



de'' ^ _ , 

We re-express Q^r in terms of m using m = |(1 - and integrate 

the resulting equation. This gives 






{Q • 

Tc Jo 




(7.5.27) 



In order to estimate this integral we show first that m(r) > m{rc) j 
for all r e (0,rc). Comparing the derivative of both functions we obtain 
that the function 



/W = ^ yMr) - 

= r^(47re(r) 



m{rc)r^ 



= 47T6(r)r^ ” 3 



m{rc)r 



Zm{rc)/rc^) 



satisfies = f/(^) + 47rr^^^ < f/(r). Thus if there is an ri € 
(0,rc) with /(ri) < 0 then /(r) < 0 for all r € (ri,rc). Because of 

m(0) = 0 the existence of an € (0,rc) with m(r 2 ) < m(rc) 
implies the existence of an ri e (0,r2) with /(ri) < 0. Since /(r) < 0 
for all r € (ri,rc) we obtain m(r) < m(rc) for all r € {r 2 ,rc) in 

contradiction to m(rc) = m(rc) (^) . Hence such an V 2 cannot exist 

which implies our estimate m(r) > m(rc) (^) for all r € (0,rc). 
Inserting this estimate into Inequality (7.5.27) we obtain 











(Q • rc^ 

Tc 2m(rc) 
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and therefore (using Equation (7.1.5)) 





(7.5,28) 



where we have set Tc = y 1 - Xhe right hand side is a third 

order polynomial in Tc* The zeros of this polynomial are 

1 , i (-1 ± . 

It follows that the right hand side of Inequality (7.5.28) vanishes for 

m{rc) e |o, |ro(l - 67T{r,f (p " ^) ± + 67r(re)2 (p - ^)) 

Since m(rc) is positive we obtain the mass-bound 




where x = yl 67r(rc)^ {p — ^) . Since the function m— ^ 2 ~ + x has 

its minimum at a; = 1/2 it follows that m{rc) < 5rc/9. If we impose in 
addition the energy condition p ~ ^ > 0 then we get rric < ^rc/9. This 
proves the first assertion. 

For the second assertion observe that m(rc) > |7r(rc)^ (^(^c) + 
since e is radially decreasing. This inequality and the upper bound for 
m(rc) restrict all possible values (r^ m(fc)) to a compact subset C C 
which depends only on e(rc),p(rc), and A. We can now solve Einstein’s 
equations for r > Tc using the known equation of state. Clearly, m(r) 
depends continuously on the data rcjm(rc). This implies that rric := 
sup{m(r) : (rc,m(r*c)) G C] is finite. I 

Theorem 7.5.2 gives an important indication for the existence of singu- 
larities in our universe. In Sect. 7.2 we have seen that all non- trivial, 
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maximally extended, non-flat spherically symmetric vacuum solutions 
with non-constant r of Einstein’s equation fail to be static in a subset of 
spacetime and contain a region where curvature diverges. We saw that 
it is possible to enter this region but impossible to leave it. Moreover, 
once having entered the region any observer will fall into the singularity 
(where curvature is infinite) within the finite time span 7rm where m 
is the Schwarzschild mass. The question arises whether this property of 
the vacuum solution also occurs for real stars which have non- vanishing 
energy momentum tensor. Since in Newtonian gravity we also have a 
central singularity in the vacuum case which can be avoided if the mat- 
ter of the star is not assumed to be concentrated in a single point, it is 
tempting to argue that the property of the Schwarzschild solution is an 
artifact of the vacuum equation. 

Theorem 7.5.2 indicates that this is not the case. We have proved that 
there is an upper limit for the concentration of matter in static, spheri- 
cally symmetric perfect fluid stars, m/r < 5/9, if the energy density of 
the star decreases outward and is positive. These physical assumptions 
are very weak and seem to be satisfied for all known objects. Moreover, 
we have seen that for any star which is governed by an equation of state 
there is an absolute mass limit. What is more, this mass limit can be 
estimated using only the equation of state for low energies. This means 
that we get bounds even if we do not know the physical configuration of 
extremely dense stellar cores. 

It has been shown^^ that there are stars which exceed the mass limits 
given in this section. This indicates that these stars will collapse into 
black holes once they have exhausted their nuclear fuel. In Chap. 9 we 
will give a very general argument to the same extent which does not 
rely on spherical symmetry. It should be noted however, that all these 
arguments in favour of the existence of black holes have loop holes. In 
this section, we heavily rely on spherical symmetry and the assumption 
of a perfect fluid. Moreover, it is conceivable that there are non-singular 
solutions which fail to be static. There are other loop holes in Chap. 9 
which we will address then. 



13 



The argument uses input from physics which is beyond the scope of this 
book, cf. (Hartle 1978) 
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In this chapter we link the concept of causality to the conformal struc- 
ture induced by the metric and present some elementary causal prop- 
erties and their interpretation. 

In Minkowski spacetime, causality is trivial since lightlike geodesics 
are straight lines. Lemma 3.1,4 shows that the local causal structure 
of arbitrary Lorentzian manifolds is the same as the causal structure 
of Minkowski spacetime. All non-trivial aspects of causality are there- 
fore global in character. In this chapter we will also discuss in detail 
the possibility of ^^causality violation*^ due to the global geometry of 
spacetime. 

Chapter 8 requires Sect. 3. 1 and develops rather specialised mathe- 
matical techniques. It contains a number of technical results which are 
needed in Chap. 9 where the existence of singularities in generic space- 
times is proved. We will restrict to those results which are necessary to 
prove and interpret these singularity theorems. For a more compre- 
hensive mathematical treatment of causality see (Beem and Ehrlich 
1981; Hawking and Ellis 1973; Penrose 1972). For more examples 
which exhibit the subtleties of causality and singularity theorems see 
(Senovilla 1998). 

According to our experience no signal is faster than light (photons). As 
we assume that photons move along null geodesics, the integrated light 
cone Cx = expa.({uar 6 TxM : g{vx^Vx) — 0,Vx future oriented} should 
(at least locally) determine which events can in principle be influenced 
from a given event a:. It is therefore plausible to identify the conformal 
structure € of spacetime with the causal structure of the universe. 

Postulate 8.0.1 (Causality and conformal structure coincide). 

X G M can causally influence y e M if and only if y G (x) . Material 
objects can reach y from x if and only if y G 

A proper justification of Postulate 8.0.1 would require a corresponding 
theory of physical particles and fields. This is far beyond the scope of 
this book. While Postulate 8.0.1 will not be important for our theorems, 
it is crucial for their physical interpretation. 

Remark 8.0.1. For arbitrary matter models, Einstein’s equation does not 
respect the link between the light cone structure and causality. For in- 
stance, a spherically symmetric fluid with equation of state e p{e) 
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satisfying dp/de > 1 has spacelike characteristics (cf. Corollary 7 . 4 . 1 ). 
Consequently, the characteristics of the initial value problem associated 
with Einstein equation is spacelike whence perturbations of the ini- 
tial data propagate faster than light For this reason one usually regards 
these matter models as unphysical. In fact, to date all classical (i.e., 
non-quantum) matter models which describe real matter have causal 
characteristics. 

The local causal structure of any Lorentzian manifold is trivial, i.e. the 
same as in Minkowski spacetime. This follows immediately from Lemma 
3.1.4 which is fundamental to this chapter. The following technical corol- 
lary will also be useful. 

Corollary 8 . 0 . 1 . Let (M,g) be a Lorentzian manifold and C a convex 
neighbourhood of x ^ C, Let K C C compact and 'y be a causal curve in 
K. Then 7 is extensible. 

Proof, Let 7 : [a, 6 ) C be a future directed causal curve in K, The 
curve 7 can be future extended if lim^-^b ^{t) exists. In order to see that 
this limit exists, let { 7 (ti)}i€N? { 7 (^;)}i€N be convergent sequences with 
lim^-^oo U = limj_,oo sj = b and x, y be their limit points. For any i 
there is a j > i with j{tj) € J“^( 7 (si),C) and for any j there is an 
i > j with 7 (si) e J~^{j{tj),C), Hence we obtain x € J“^(y,C) and 
y e J'^(a:,C). Hence by Lemma 3.1.4 (i) there are two future directed 
causal vectors v,w with x ~ expy{v) and y = exp^(ty). Traversing the 
geodesics t expj^(tu) backwards we see that at x there is also a causal 
past directed vector u with exp^(u) = y. Since the exponential map exp^, 
is a diffeomorphsim of an open set C C T^M to C we must have w = u. 
But this is only possible if both vectors vanish. I 



8.1 Causality conditions 

It is easy to mathematically construct a spacetime with closed timelike 
curves. At first glance one is tempted to rule out such spacetimes since 
it seems possible to perform experiments in them which lead to logical 
contradictions. In this section we will investigate this issue in some 
detail. We will also define a slightly stronger ‘‘causality condition” 
which will play an important role in subsequent sections. 

In a general Lorentzian manifold, it is possible for closed timelike curves 
to exist. 

Definition 8 . 1 . 1 . Letx € M. We say that causality (resp., chronology^ 
is violated at x if and only if there exists a closed, non-trivial causal 
(resp., timelike) curve from x to x. The chronology violating set is given 
by 
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{x € M : X e 

and the causality violating set by 

{x E M : 3 a non-trivial causal curve 7 from, x to x}. 

A Lorentzian manifold (M^g) is causal (resp. chronological j if the causal- 
ity violating set (resp., chronology violating set) is empty. If {M,,g) is 
chronological (resp, causal)^ we sometimes say that the chronology con- 
dition (resp, causality condition^ holds. 

The term ^causality violation’ is somewhat misleading: the possibility of 
closed causal curves is not contradictory itself and there are no mathe- 
matical arguments against causality violation. 

The idea that there may be closed timelike curves in our universe is 
not new: The concept of cyclic time was a widespread idea in ancient 
Greek philosophy (Kanitscheider 1984, p. 45). These Greek philosophers 
accepted our fundamental experience of local linearity of time but they 
compactified the time line to a time circle. Its circumference was identi- 
fied with the time of one revolution of the universe (according to their 
model of planetary motion or just according to ‘arbitrary’ laying down^). 
The sheer length of this period is sufficient to explain why nobody of us 
ever has reentered her/his own past. 

We can easily obtain a spacetime whose causal structure is analogous 
to the causal structure of this ancient Greek model. Just take a horizontal 
strip of 2-dimensional Minkowski space and identify the upper and the 
lower boundary (cf. Figure 8.1.1^). Another very instructive example is 




Fig, 8.1.1. A strip of two- 
dimensional Minkowski 
space where future and 
past boundaries are 
identified. 



the Lorentzian manifold 

^ Plato, for instance, chose 10,000 years (Kanitscheider 1984, p. 55) 

^ The arrows in this and other figure indicate how both sides to be identified 
are oriented 
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(R X 5^, 2d(^dt + 

first given in (Misner 1967) (cf. Fig. 8.1.2). The chronology violating set 
is given by ip) : t < 0}. 




Fig. 8.1.2. Misner^s spacetime (5^ X R, 2dtdp + tdp^) 



The “time compactification” arising in these examples is trivial in the 
sense that in both examples there is a locally isometric Lorentzian man- 
ifold which satisfies the chronology condition. In Lorentzian geometry 
however, there also exist non-trivial examples where causality violation 
arises geometrically and not merely topologically. An example which will 
also be of importance in Chap, 9 is the Godel solution (Godel 1949). 

Example 8A,1. The Godel solution describes a solutions of Einstein’s 
equation with dust and postive cosmological constant, Ric Ag = 

87reu^ 0 where w is a timelike unit vector field and e — 7l/(47r). The 
metric is given by 

^ ^ ^ - d^^ -f dr^ H- (sinh^(r) ~ sinh^(r))d</?^ 

4* 2\/2sinh^(r)dv?dt^ -f dz^, 

where we have r > 0 and identify p with p + 27t. The vector field 
has closed integral curves and it is spacelike for r < arsinh(l). For r = 
arsinh(l) the integral curves of dp are lightlike (but not null geodesics) 
and for r > arsinh(l) they are timelike. Since sinh^(r) is an even function 
of r it follows that at r — 0 we have only the usual coordinate singularity 
associated with polar coordinates. Hence spacetime has the topology 
R^ and is in particular simply connected. It follows that chronological 
violation is an inherent property of the solution. 
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A physically interesting solution of Einstein’s vacuum equation with van- 
ishing cosmological constant is the Kerr solution. For details cf. (O’Neill 
1995), (Wald 1984), (Hawking and Ellis 1973). 

Despite the existence of these examples most physicists regard causal- 
ity violation as ‘unphysicaP. The reason for this rejection of causality 
violation is the following thought experiment: 

Suppose, you are travelling in spacetime and reach a point in 
your own past before your departure. Now you decide not to 
travel after all and instead to stay home. Contradiction. 

At a first glance, the possibility of “free will” seems to be at the centre of 
the issue. However, following (Wheeler and Feynman 1949) Clarke (1977) 
has re-formulated the thought experiment in terms of a simple machine 
and has argued that the thought experiment is fallacious: Assume that 
there is a gun directed at a target in spacetime. This target is connected 
with a shutter which, if closed, blocks off the path between the gun 
and the target: If the gun is triggered, the bullet will hit the target 
which in turn will cause the shutter to fall. A second shot will now be 
blocked by the shutter and therefore cannot hit the target (c.f. Fig. 8.1.3) . 
Now assume that the configuration is located in a region with causality 
violation such that the shutter falls along a closed timelike curve so that 




it blocks the bullet before the gun had been triggered. Again we seem 
to arrive at a contradiction: If the shutter is open the bullet can hit the 
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target. But the target closes the shutter which in turn blocks the path 
of the bullet. 

This paradox may be resolved as follows. The angle a under which 
the bullet is deflected by the shutter depends continuously on the shut- 
ter’s position X at the time the bullet passes the shutter. For simplicity 
we assume that the shutter will descend with constant velocity v. This 
velocity is continuously related to the angle a. If the length of the closed 
causal curve is T we obtain the relation x — Tv[a{x)). This equation 
has at least one solution xq which leads to a contradiction-free situation. 
In physical terms this can be explained as follows: The original contra- 
diction is due to the fact that the shutter is thought to be either up or 
down. However, the position of the shutter depends continuously on the 
parameters of the system. What happens is that while the shutter de- 
scends it grazes the bullet and thereby deflects it so that the mechanism 
works only imperfectly. As a consequence, the shutter is released rather 
late and not yet in place when the bullet hits it again due to causality 
violation. Hence it grazes the bullet and we are in a paradox-free time 
loop. 

This scenario appears to be highly non-generic but Clarke argues 
that exactly this is the effect of causality violation: It picks out those 
non-generic data which are in accordance with the causal anomaly. The 
gist of the argument rests on the assumption that physical processes are 
continuous, an assumption which does not hold for quantum mechanical 
systems. These systems may be in discrete pure states such as spin up or 
spin down. However, if one tries to set up a quantum thought experiment 
one is faced with the fact that all predictions are probabilistic which 
invalidates the whole thought experiment from the outset. 

There are also arguments against Clarke’s resolution of the paradox. 
Instead of releasing the shutter directly when the target is hit we may 
have a device which automatically releases the shutter a certain time 
after the impact. This can be achieved with an electronic switch rather 
than a mechanical connection between target and shutter. It seems now 
much less probable that this device always releases the shutter such that 
it grazes the bullet when coming down. For Clarke’s argument to work 
the bullet must comes out of the gun so slowly that it just touches the 
target but does not really hit it. Otherwise it cannot be explained that 
the second device is not successful in releasing the shutter at the pre-set 
time which would lead to a contradiction. 

Whether Clarke’s argument is correct or not — we are only able to 
conduct local experiments. But causality violation is a global effect, and 
so the lack of experience cannot give evidence of its absence. Any objec- 
tion against causality violation rests on an (unjustified) extrapolatiom 
of every-day experience. 
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There is another point which should be addressed. Causality vio- 
lation seems to constrain free will. While this is not really a physical 
problem, such an effect would have some bearing on philosophical and 
moral questions. But an almost trivial observation resolves any possible 
argument concerning free will at once: If we want to incorporate the no- 
tion of free will into a physical description we have to view it at least as 
a quantum effect (or caused by another yet undiscovered ‘mechanism’), 
but certainly not as something fitting into the framework of classical 
physics. We only can expect that general relativity is a classical limi t of 
such a theory. It is therefore quite possible that ‘free will’ is something 
like a second order effect and that the classical “limit-spacetime” of our 
world contains closed timelike curves even though we still enjoy free will. 

With this discussion in mind we should always be very watchful if 
in order to obtain physical results the seemingly innocent assumption of 
chronology has to be made. 

Lemma 8.1.1. The chronology (resp., causality) violating set consists 
of connected components of the form I'^{xi) n I~{xi) (resp., J~^{xi) n 
J~{xi)) ('i = 1,...;. 

Proof. We only show the lemma for chronology violation. The proof for 
causality violation is completely analogous. Let (7 be a connected com- 
ponent of the chronology violating set and x eC. Since C is connected 
there is for each pair of points {x, j/} c C a (not necessarily causal) curve 
j C C which connects x and y. Since for all 2 e C the set I'^iz) is a 
neighbourhood of x and the curve 7 is compact, there are finitely many 
Zi € 7 such that Zi+i e I'^{zi) and the neighborhoods /'*'(zi) cover 7. It 
follows that there is a timelike curve from a: to j/. By the same argument 
there is a timelike curve form y to x. Hence C Cl'^ {x) n I~ {x) and the 
assertion follows since I'^{x) n I~{x) is connected. I 

The following proposition shows that a compact spacetime cannot be 
chronological. 

Proposition 8.1.1. If M is compact then the chronology violating set 
of M is non-empty. 

Proof. We can cover M with finitely many sets of the form I'^{xi) {i = 
l,...,k). If a;i is not contained in I'^(xi) there is an ^ € {2 , . . . , A:} and 
a permutation cr with xi € /+(a:a(o), xi i \J^r\ /+(x<,(^)). This implies 
^ Ui=i since otherwise we would have 

i~i 1-1 

XI € /+(x,(^)) C 7+(U /+(x,«)) = [j /+(x,,(i)) 

2=1 2=1 

in contradiction to the definition of x;. If x; ^ /+(xj) we can apply the 
same argument to x/ instead of xi. Since there are only finitely many 
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Xi^ one of the Xi must be in its own future for otherwise we would have 
that is in none of the I 

Proposition 8.1.1 is often taken as a reason for dismissing compact space- 
times as unphysical. 

While the chronology condition and the causality condition are very 
intuitive, from a technical point of view, a slightly stronger condition is 
advantageous: 

Definition 8.1*2, The strong causality condition holds at x e M if for 
any neighbourhood V of x there is a neighbourhood U CV of x such that 
any causal curve intersects U at most once. 




Fig. 8.1.4. A spacetime which 
is causal but fails to be strongly 
causal 



In other words, if the strong causality condition does not hold at x, there 
are timelike curves starting at x which come arbitrarily close to x after 
leaving a giving convex neighbourhood. Hence the chronology condition 
is almost violated. In the next section we will see the importance of this 
causality condition. 

Finally, we wish to introduce global hyperbolicity^ ^ the strongest causality 
condition which is often assumed. Its relevance stems from the fact that 
in a globally hyperbolic spacetime the set of causal curves connecting 
two given points is compact in a natural topology. We will use related 
properties in the next section. 

Definition 8.1.3. A subset of A C M is said to be globally hyperbolic if 
A is strongly causal and for any two points € A the set J~^{x)C\J~ {y) 
is compact 



^ This name has been coined by Leray (1953) in connection with systems of 
partial differential equations. 
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Fig. 8.1.5. A spacetime which 
is strongly causal. An infinitesi- 
mally small perturbation of the 
metric results in a spacetime with 
chronology violation 

8.2 Cluster and limit curves 

In this section we study sequences of causal curves and there limits. 

The results of this section will be fundamental to what follows. It is 
based on (Beem and Ehrlich 1981). 

It is practical to generalise the concept of a timelike or causal curve 
to continuous curves. 

Definition 8.2.1, A continuous curve 7 is called causal (resp.j time- 
like ) if every point x on j has a convex neighbourhood C such that any 
point y ^ X on 7 n C can be connected by a causal (resp,, timelike) 
curve which is contained in C. 

Clearly, this definition coincides for C^-curves with our previous Defini- 
tion 3.1.3 (iii). 

Lemma 8.2.1. Let x e M. There is a convex coordinate neighbourhood 
of C of Xj a constant k > 0 , and coordinates (x^, . . . such that all 

causal curves ^ in C can be parameterised by t ~ x^ and the coordinate 
inequality 

n 

^ (7“(t) - 7“(s))^ <k\t-s 

a=0 

holds for allies. 

Proof. We choose coordinates (x^, . . . in a convex neighbour- 

hood C of X with compact closure such that dx^ is timelike and all 
dx^ {i € — 1}) are spacelike. Then any causal curve in C can 

be parameterised by x^. Let ^ be a causal C^-curve with ja{t) = 7(t) 
and p{s) = 7(5) (cf. Fig. 8.2.1). Since the closure of C is compact there 
exists a constant ko > 0 such that all causal vectors are also causal with 
respect to the flat metric -kodt'^ particular, p satisfies 
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Fig. 8.2.1. The proof of Lemma 
8.2.1 



ko ~ denote the standard basis of by 

{eo, . . . , e„_i} and write ||u“e„||2 = \ZE”=o(^“)^ we obtain 



\\l°'{t)e-a - 7“(s)eo||2 — IIm“(^)6o - M“(s)ea||2 = II / A^Md^eal 

J S 

~ Is < \/l + fco(t - s). 



Corollary 8.2.1. It follows that causal curves are Lipschitz and there- 
fore differentiable almost everywhere. 

For x,y £ M let Causal space of continuous causal curves 
from X to y and C^j^^{x,y) be the space of timelike curves from x to y 
which are . We will now specify a natural topology for the space of 
causal curves. 

Definition 8.2.2. Let 7: [0,6] ^ M, 7i : [a,6] -1 M (ieN) be curves. 
The sequence {7j}ieN converges to 7 in the C°-topology if for every 
neighbourhood V of j in M there exists an io eN such that ji CV for 
all i> io. The curve 7 is called the limit curve of the sequence {7i}i6Pj. 

Our terminology is slightly at odds with the traditional definition of 
“limit curve” in general relativity but closer to generic mathematical ter- 
minology. Often, not limits of curves (with respect to a natural topology) 
but curves which are better thought of as a set of pointwise accumula- 
tion points are called “limit curves” . We will reserve the term “cluster 
curve” for such accumulation curves: 

Definition 8.2.3. Let 7: [a, 6] — > M, 'ji,: [a, 6] M (i € 6e curves. 
7 is said to be a cluster curve of the sequence {7i}igN if there exists a 
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Fig. 8 . 2 . 2 . A limit curve 7 
of a sequence of curves jn 



subsequence such that for all x e j each neighbourhood of x 

intersects all but finitely many of the curves 7 ^ . . Following Beem and 
Ehrlich (1981) we will say that the sequence { 7 i^}j€N distinguishes the 
cluster curve 7 . 

It will turn out that for strongly causal spacetimes cluster and limit 
curves are essentially the same (cf. Theorem 8.2.2) below. There is no 




Fig. 8.2,3. An example where a limit 
curve from rr to y is not a cluster curve 
for a sequence of points from x to y 



strict logical relation between limit curves and cluster curves. Consider in 
the flat cylinder {S'^ x E, d(/?dt) (cf. Fig. 8.2.3) the sequence of identical 
curves 7 ^ which connect x with y and satisfy cp ~ const. The curve 
7 which first traverses t = t{x) and then connects x with y satisfying 
ip — const is a limit curve of 7 ^ but it is not a cluster curve. In Fig. 8.2.4 
we have a cluster curve 7 of a sequence of causal curves which is not 
a limit curve. In general, a cluster curve of a sequence of causal curves 
may even be spacelike (cf. Fig. 8.2.5 

Proposition 8 . 2 . 1 . If {M^g) is strongly causal and y is a cluster curve 
of a sequence { 7 i}^eN of causal curves^ then 7 is causal 

Proof. Since (M, g) is strongly causal, we may cover 7 by convex neigh- 
bourhoods Ci such that no causal curve can enter any of these neigh- 
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Fig. 8.2.4. A spacetime 
which is causal but fails to 
be strongly causal 




Fig. 8.2.5. Assume that 
h/ CIS rational and a/h is ir- 
rational. Then the projec- 
tion of the line with slope 
c/a from to the torus 
depicted in the figure is a 
dense curve 7 . Hence every 
curve is a cluster curve of 
7 



bourhoods twice. Consider one such neighbourhood and denote it hy C. 
Let x^y he points in 7 Pi C and denote by { 7 ^^. }j^n a subsequence which 
distinguishes 7. Then there are sequences Xj,yj G ji- DC with Xj x 
and yj — > y. Since yi. G J'^{xi.^C) Lemma 3.1.4 implies the existence of 
a causal vector Vj with exp^. {vj) — yi ,. . These vectors have an accumu- 
lation point V with exp^{v) — y. The vector v must be causal since the 
set of causal vectors is closed. But this implies y G J^{x^C). If x and 
y are arbitrary points on 7, we can find finitely many neighbourhoods 
Ci such that the segment from x to y is covered by \J^Ci. We can now 
apply the preceding argument finitely often to conclude that x and y are 
causally related. I 

The basic Lemma 8.2.4 below is an application of the theorem of Ascoli 
which we will present first. 

Let A C be a compact set. The space of continuous functions 
= {/: A : / is continuous} 
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can then be regarded as a normed vector space in a natural way. Just 
ll/lloo = sup^gy^{|/(a:)|}. Moreover, this norm is complete, i.e., ev- 
ery Cauchy sequence {fi} C converges to a function / G 

R^). (/ can be constructed pointwise using the completeness of 

RK) 

Lemma 8.2.2. Let B C C{A,R^) be a dosed set and assume that for 
every e > 0 there are finitely many balls . . . , with 

radius e and B C Ui=i Then B is compact 

Proof. If the lemma is not true then there are open sets U^, {t e I) 
covering B such that no finite subset covers B. Let 

be a finite set of balls of radius 1 which cover B. By our assumption 
one of these balls cannot be covered by finitely many Ut,. (Otherwise we 
would obtain a finite cover of B by finitely many sets which are in turn 
finitely covered by sets Ut,). We denote this ball by Sq. Assume that we 
have constructed balls {Bi}i=o,...,A:-i such that 

(i) Any two consecutive balls intersect, 

(ii) Each ball Bi has radius 2"% 

(iii) None of these balls can be covered by finitely many Ut,. 

There are balls B\_k{xi), . . . , ^ {^k{ 2 -^)) which cover B and there- 

fore also Bk~i^ Since B^^i cannot be covered by finitely many Ui there 
must exist at least one B‘^k{xm) which intersects Bk~i and cannot be 
covered by finitely many Ut,. Denoting B'^_k{xm) by B^ we have induc- 
tively defined a sequence {Bi}i^f>^uo of balls which satisfy (i)“(iii). Denote 
the centres of these balls by Pi. For any natural numbers m < n we obtain 

\\yn-ym\\< E \\yi-yi-4< E (2-* + 2-*+i))<2-2-"‘. 

2=m+l i— m+1 

Hence {yi}i^mo is a Cauchy sequence. Denoting its limit by y there is 
an Wto which contains y and a number r G N such that the ball B 2 -r{y) 
is contained in But this implies B 2 -v-i{yr-\-i) C Ut, in contradiction 
to (iii). I 



Theorem 8.2.1 (Ascoli). Let A CR^ be a compact subset and fi: A ^ 
R^ (i eN) be an equi- co ntinuous seq uence of continuous functions such 
that for all a e A the set /^(^) compact. Then there is a contin- 
uous function f: A M and a subsequence {fi.}j^^ of {fi}i^^ which 
converges uniformly to f. 
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Proof. We show first that the subset USil/O is compact in the normed 
space II -||oo). By Lemma 8.2.2 we only have to show that for 

any e > 0 there is a finite number of balls with diameter less than e which 
cover Let e > 0 and a G A. Since {fi}ien is equi-continuous, 

there is for each a e ^ a neighbourhood Ua of a such that for all fj 
and all y e Ua the inequality \\fj{a) - fj{y)\\oo < e /4 holds. Since A is 
compact, we can cover A with finitely many such neighbourhoods Ua^ 
k}). Since the union 

k ~oo 
i=li=l 

is compact it can be covered by finitely many open balls of radius e/4. 
Denote their centres by e K (s € {1, ... ,r}). We will now construct 
finitely man y neighbo urhoods of diameter less than e in Cfi{A, which 
cover all of These neighbourhood will be defined by the re- 

quirement that the functions / in each such neighbourhood have values 
X = f{a) close to Xg for all a near some a*. More precisely, consider the 
finite set of all maps a : {1, . . . , A;} — > {1, . . . r} and let 



Va=^lheC\A,W)f)[j{fi}-. 

[ i=l 

||/i(oj) -x„.(i)|| < e/4 Vf € fc}|, 

/ ^ Since K is covered by finitely many balls xu,,.,Xr of 

radius 64 there is for each I an Xs^ such that |l/(a/) -Xg, || < e/4. Defining 
a{l) = Si we see that / € V^. Hence the sets Va cover all of U^i{/ 0 - 
Since the sets , . . . , Uat, cover A there is for h,h GVo- and a € ^ an 
Z € {1, . . . , fc} such that \\h{a) — h{ai)\\ < e/4 and \\h{a) — ^(a^)|| < e/4. 
This implies 

\\h{a) ~ ft(a)|| < 11%) - h{ai)\\ + ||%) - h{ai)\\ 

+ il/i(aO ” Xa(i) II + \\Xa(i) ~ h{ai)\\ 

<4-e/4 = e. 

Hence ea ch set Va is contained in a ball of radius e. This implies that 
UiZiifi} IS covered by finitely many balls of radius e and therefore com- 
pact. 

The assertion follows now since in a compact subset of a normed 
space every sequence has a convergent subsequence. I 
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Lemma 8.2.3* Let C be a convex coordinate neighbourhood with com- 
pact closure and let {7i}ieN be a sequence of causal curves in C which are 
inextensible in C, If x e M is an accumulation point of this sequence, 
then there is a causal cluster curve 7 through x which is inextensible in 
C. 

Proof By considering a subsequence we can assume without loss of gen- 
erality that for each i there is a U such that 7i(^i) x. Any cluster 
curve 7 of this sequence intersects x. We choose the same coordinates as 
in Lemma 8.2.1 and view the curves 7^ as continuous maps from finite 
intervals [a^, bi] to In order to apply the Theorem of Ascoli 8.2.1 we 
trivially enlarge the domains [ai,bi] to [inf^er^(ai), sup^^f5^(&i)] by setting 
7i(^) = liiO'i) ^ ^ [^1 ^i] 7i(t) = 7i(6i) for t G [bi, b]. This family of 

maps is equi-continuous by Lemma 8.2.1. Since C has compact closure so 
has the set UieN7^(^) theorem of Ascoli 8.2.1 implies that 

a subsequence of these curves converges uniformly to a continuous curve 
7. To see that 7 is causal let t,s e [a, 6], s <t. Since ji{t) e 
for all i, there is a causal geodesic pi from 7^(5) to 7i(i) for all i. By the 
continuous dependence of the solutions of ordinary differential equations 
with respect to initial values, there is a limit geodesic fa starting at 7(5). 
It is clearly causal and has future end point -y{t) by the convexity of C, 
This implies j{t) e J‘^(7(s),C). Since s and t were arbitrary, '^must be 
a causal curve. Since 7(a) is an accumulation point of 7i(a) e C\C this 
point must also lie in C \ C. Analogously for 7(6). This implies that 7 is 
inextensible in C. I 



Lemma 8.2.4. Let {7i}igH ^ sequence of future inextensible causal 
curves. If x G M is an accumulation point of this sequence, then there 
is a causal, future inextensible cluster curve 7 through x. 

Proof, Let Cx be a convex neighbourhood as in Lemma 8.2.3. Then 
there exists a subsequence which distinguishes a cluster curve which is 
inextensible in Cx^ We may now take the intersection Xi of the future 
boundary of Cx with this cluster curve. Applying the same argument 
to a convex neighbourhood Cxi of xi and the subsequence, we obtain 
an extension of the cluster curve to Cx U Now we can proceed by 
induction. I 



Proposition 8.2.2. Assume that (M, g) is globally hyperbolic and let 
{yi}i€N be sequences of points converging to x,y e M such that 
Pi € J'^{xi), y € J~^{x), Let 7^ be a causal curve from Xi to pi. Then the 
sequence {7i}i€N has a cluster curve which connects x with y. 

Proof. Let x € I~{x) and y G The set J~^(x) H J~{y) is then a 

neighbourhood of x, y and we can assume without loss of generality that 




372 8. Causality 



all lie in {x) n J (y) . Since (;r) D J (y ) is compact, we can extend 

all 7i such that they are inextensible in J^ {x) fl J~(y). By compactness 
and strong causality of J+(i) n J”(y) we can cover this set by finitely 
many convex neighbourhoods as provided by Lemma 8.2.3 such that no 
7i intersects any neighbourhood twice. Now the claim is an immediate 
consequence of Lemma 8.2.3. | 

Theorem 8.2.2* Let (M,y) he strongly causal For given points x,y 6 
M let ji : [a, b] M be a sequence of causal geodesics with 

lim 7^a) = x and lim 7^(6) — y. 

+00 i~~^oo 

Then 7: [a, 6] — > M, 7 € C^3^jgg|(x,y) is a cluster curve of the sequence 
{7i}iGN if o.nd only if there is a subsequence {7^ of {'yi}ieN which 
converges to 7 in the topology of curves. 

Proof Assume first that 7 is a cluster curve of {7^}ieN' Since the image 
of 7 is compact, there are for each neighbourhood ZY of 7 finitely many 
convex sets such that 

(i) 7 C U/c=:l C W, 

(ii) no causal curve can enter any Ck twice. 

We have to show that there is a subsequence {^i. such that for suffi- 
ciently large j all 7^ . are contained in W. We can choose a finite sequence 
of points {'y{tk)}k^i,...,k with 7(ffc) ^ Ck r\Ck-\-i^ By the definition of 
a cluster sequence (and the fact that there are only finitely many such 
points) there is a subsequence {7^^. } such that each ji. intersects all 
of the sets Ck H Cfc+i. Since all Ck are convex and no causal curve can 
re-enter any Cfe, 7*,. is contained in Ui=i CU. 

Conversely, assume (without loss of generality) that {'ji}ieN con- 
verges to 7 in the topology of curves. Let /i be a fixed Riemannian 
metric on M and let for r € N Ui be a neighbourhood of 7 which is 
the finite union of convex sets such that no causal curve 

can enter any Ck^i twice and all 1 have diameter less than ^ with 
respect to h. We can extend the causal curves ji to inextensible curves 
in Ufe=i^A;,e* Now let {Wi}^eN be a sequence of neighbourhoods of 7 
such that n^i — 7- Since {7i}ieN converges to 7 in the topology 
of curves, we obtain a subsequence of curves {7i,.}r€N with 7^^. e Ui for 
each r G N. Lemma 8.2.4 implies that this subsequence has a causal clus- 
ter curve A. Since A lies in the intersection of all Ui and this intersection 
equals the set traversed by 7, we obtain A C 7. The other inclusion 7 C A 
follows since A is inextensible and 7 does not have self-intersections. I 

In view of Corollary 8.2.1 we can define the length 1^(7) of a causal curve 
7 simply by setting 
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^( 7 ) = j V- (tW.7 (*)) di. (8.2.1) 

Since in any neighbourhood U of any causal curve 7 C C^a^g3j(a;, y) 
there is a broken null geodesic from rr to y, the length functional can- 
not be lower semi-continuous with respect to the topology of curves. 
If (M,y) is not chronological, then the length functional is not upper 
semi-continuous either. To see this consider a timelike curve 7 which 
has self intersections. Then in any neighbourhood of 7 there are time- 
like curves which contain a closed segment which can be repeatedly 
traversed. In particular, we see that i = 00 for all such curves while 
7 has finite length. On the other hand, if (M, g) is causal, we have the 
following proposition. 

Proposition 8.2.3. Let x^y € M and assume that M is causal The 
length functional L is upper semi-continuous in C^causai(^?2/) respect 
to the topology of causal curves. 

For the proof of Proposition 8.2.3 we will need the following result. 

Lemma 8.2.5. Let (M,y) be causal and 7 € with ^(7,7) — 

— 1. Then there is a neighbourhood U of ^ in M and an extension of the 
curve parameter t of ^ to a function t: U R such that dt is timelike 
and grad(t)|^ = -7. 

Proof. Let W be a neighbourhood of 7 in M which is the union of a 
finite number of convex sets with compact closure. Let 7 be a time- 
like extension of 7 which has no endpoint in U (cf. Corollary 8.0.1). 
Since (M,y) is causal 7 cannot have self intersections. Hence we can 
choose U to be simply connected. Let {Eo(t), . . . , -En-i(t)} be an or- 
thonormal frame along 7 with £^0=7. The map /: V C M”' — ^ W, 
1-^ exp.^(i) is a local diffeomorphism 

near 7 since exp is a local diffeomorphism near the origin. Hence we 
can choose U and V so small that / is a diffeomorphism. This and 
the fact that U is simply connected (otherwise t may be a cyclic co- 
ordinate and only defined modulo an additive constant) implies that 
we can extend the function t to all of U. Since dt is timelike at 7, by 
choosing U small enough, dt is timelike everywhere in U. Finally ob- 
serve that dt{Ei) = dt(^ exp.^(t) = 0- Hence 

grad(t) is orthogonal to Ei and therefore parallel to Eq — 70 From 
1 - dt(7o) = y(grad(t),7) we obtain grad(t)|,y = -70. I 

Proof of Proposition 8.2.3. Since C^causai(^» v) ^ need only 

show that the length functional is upper semi-continuous in C^^^^(x,y). 
Let 7: [a, 6] M be a curve in Ct^^^(a:,y) and U be the neighbour- 
hood given by Lemma 8.2.5. Let /a be any timelike curve which con- 
nects X and y and is contained in U. Since dt{fi{t)) = 1 we obtain 
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A(^) = g(grad(t fi raa(t)) +'^(^)) where v{t) is orthogonal to grad(t). 

This implies 

-«,(.)(/!,« = - S„(.)(gtad((),pad(0) 

- ( p(gr,d(.)‘.padW) ) 

1 

” ^(grad(t),grad(t))' 

On the other hand, we have 1 = —9j{t){'yn) = ~ 5 ^ 7 (t)(gi’ 2 id(t),grad(t)). 
Since x »-> gfa,(grad(t),grad(t)) is continuous we may choose for a given 
e > 0 the set U so small that ~1 - € < ^(grad(t), grad(t)) < -1 + e. This 
implies < i~ = ~^^9^{t){dudt) and therefore L{ji) < 

Since ji G was arbitrary the length function L is 

upper semi-continuous on this set. I 

Remark 8,2J. Hawking and Ellis (1973) give a slightly different defini- 
tion for the length of a causal curve. They first define the length of a 
timelike curve 7 G by Equation 8,2,1. For a causal curve (j, 

they set 

L()U) = inf{f(W) : 7/ is a neighbourhood of ^}, 

where 

i(U) = sup{i( 7 ) : 7 C W is timelike and 

It follows that this length functional L is also upper semi-continuous in 
^causal respect to the topology of causal curves. Hence in 
causal spacetimes, L and L coincide. However, if (M, g) is not chronolog- 
ical, L does not coincide with the original (and more intuitive) definition 
since for any timelike curve 7 with self-intersection one has L{j) = 00 . 



8.3 Achronal submanifolds 
and Cauchy developments 

Given a set A the set I'^{A) consists of those events which can be influ- 
enced by It is clear that its boundary dI~^{A) is of special importance. 
In this section we show that dI~^{A) must be a Lipschitz manifold. We 
will also investigate the “domain of dependence” of A, i.e., the set of 
those events which are completely determined by the physical data of A, 
For the following, definition, standard examples are spacelike sub- 
manifolds and dI'^{B) where B is any subset. 
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Definition 8.3.1. A subset A C M is called achronal if there is no 
timelike curve which intersects it twice, A set F is called a future set if 
I^{F) C F, Past sets are defined analogously. 

Lemma 8.3.1. Let F be a future set. Then dF is an achronal Lipschitz 
hypersurface. 

Proof Let x e dF. Since /^(x) C F is open, dF must be achronal. It also 
follows that I~~{x) C M\F, Consider a convex coordinate neighbourhood 
ZY of rr as given by Lemma 8.2.1. The integral curves of d^o 

through (0,x^, , . . are timelike. Since the integral curve of d^o 

through X intersects both I'~ {x, C) and I~^{x^ C) there is a neighbourhood 

V C W of X such that all integral curves of d^o which intersect 

V intersect J“(x,C) and I'^{x^C). It follows that each of these curves 

intersects dF in a unique point Since dF 

is achronal, there is a constant k > 0 such that 



|ar° o j/(a;S . . . , a;” ^) - x° oT/(f^, . . . ^)| < 



n— 1 



^ (a;* - x *)2 



for all {x^,,. . ,x^ ^), (x^, . . . , x^ ^). Hence the function 
is Lipschitz. 



Definition 8.3.2. The nulL boundary of a future set F is the set 
gnuiiF — {a; G dF : 3 a neighbourhood U of x with I'^{F\U) = I~^{F)}. 
The acausal boundary of F is d^^F — dF \ d^^^^F. 




Fig. 8.3.1, The definition of null boundary and achronal boundary 



This definition is justified by the following lemma. 

Lemma 8.3.2. Let F be a future set and x € Then there is an 

achronal null geodesic generator through x with future endpoint x. This 
generator is past inextensible or has past endpoint in d^^F. 
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Proof. Let W be a neighbourhood of x such that J+(F) = I~^{F \ U). 
There is a sequence of points C I'^{F) with Xi x. Let 7^ be a 

causal curve from some yi € {F) \U to and let 7 be a causal cluster 

curve with future endpoint x. This curve cannot intersect I'^{F) since 
then we would have x € I~^{F). It follows that 7 lies in the achronal set 
dF and is therefore an achronal null geodesic. If 7 has a past endpoint 
G we can repeat the argument above to obtain a cluster curve jj, 

which has future endpoint 2; and which is also an achronal null geodesic. 
Since the concatenation of fi and 7 must be achronal j they are both part 
of the same null geodesic. It follows that this null geodesic is either past 
inextensible or has past endpoint in d^^^F. I 



Definition 8 . 3 . 3 . Let A he an achronal set Then the edge of A, edge(^) 
is the set of all points x € A such that for any neighbourhood U of x each 
pair of points in I^{x^U) can be joined by a timelike curve which is 
contained in U and does not intersect A, 

Corollary 8 . 3 . 1 . Let A be a set. Then dJ^{A) is an achronal Lipschitz 
hypersurface. The set dJ^ (A) \ A is generated by null geodesics without 
conjugate points. These generators are past inextensible or have past 
endpoint in edge(^). 

Proof. The first property is clear since J'^(A) is a future set. Let x G 
dJ~^{A)\A and 7 be the maximally past extended null geodesic generator 
with future endpoint x. Let y be the past endpoint of 7. Since 7 is a 
generator of 5 {A) \ A there is a neighbourhood W of 7 \ {y} which 
does not intersect A. 

Assume that y G A\edge(A). Then there is a neighbourhood convex 
U oi y such that for every pair of points in I^(y,U) every causal 
curve A which connects z” and z~^ intersects A. Let G I~{x,U)^ 

G ^U) n I~{z^,U) n 7 n W, and Ai a timelike curve from 

to z^. There is a point G I~^{y^U) n W and a timelike curve A2 C 
W from2:^ to The concatenation A of Ai and A2 intersects A since 
y e A \ edge(A). By the construction of W it is clear that A2 cannot 
intersect A. Hence there is a point z G A2 n A. Consequently, we obtain 
X G J~^{zq) C C in contradiction to x G dJ^{A). This 

proves y ^ A\ edge(A). 

Assurne now that y is past endpoint of the null geodesic generator 7 
and is not contained in A. Then y has a neighbourhood U which does 
not intersect A. This implies that for each z G I~^{A) O U there is a 
timelike curve from A to z which initially does not lie in U. Hence we 
have shown I~^{A) = /*^(J‘^(A)) = J’^(/'^(A) \ W), i.e., y G J'^(A). 
This is impossible by Lemma 8 . 3 . 2 . I 
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Lemma 8.3.3. If A is a spacelike hypersurface, edge(^) is a subset of 
the boundary of A. 

Proof Let x € A\ dA, Since A is spacelike there exists a neighbourhood 
U of X such that (x,U)\Jl~ {x,U))r)A = 0. Since ^ is a hypersurface 
it divides U (if chosen small enough) into two disconnected components, 
one of them containing and the other I~{x,U), But this implies 

that every causal curve from I~{x,U) to I'^iU) must intersect A, I 

Another important set is the future horismos of A C M. It consists of 
those points which can be reached from A via causal but not via timelike 
curves. 

Definition 8.3.4. The future horismos of a subset A C M relative to 
a neighbourhood U of A is the set E'^{A,U) = J’^(A, W) \ I‘^{A,U), 

Lemma 8.3.4. Let A C M be a spacelike submanifold andx € M, Then 
X e E'^{A,U) holds if and only if there is a null geodesic from A to x 
which is completely contained in U and does not have focal points before 

X. 

Proof, This follows immediately from Corollary 8.3.1, Theorem 4.6.1, 
Lemma 4.6.15, and the fact that jB+(A,W) is a subset of SJ’^(A,W). I 

If A is a set we are also interested in those points which are completely 
determined by the physical data at A. If Postulate 8.0.1 holds then this 
set of these points is described by the following definition. 

Definition 8.3.5. Let A be a set Then the future Cauchy development 
(A) is the set of all points x e M such that all past inextensible causal 
curves through x intersect A. The past Cauchy development is defined 
analogously and denoted by D~{A), The union D{A) = D'*‘(A)UD"(A) 
is called the Cauchy development of A, 

Lemma 8.3.5. For any achronal set A we have /~(D+.(A)) n/+(A) c 
D+(A). 

Proof, If X e /“(£>+ (A)) n /+(A) then there is a y € D'^{A) n 
Let 7 be a timelike curve from x to y. If x ^ D~^{A) then there is a past 
inextensible curve /i with future endpoint x which does not intersect A. 
Since the concatenation of /i and 7 is a past inextensible curve with future 
end point y the curve 7 intersects A at some point .2;. From € I^{x) and 
x 6 /"^(A) we obtain z G /"^(A) which implies that there is a timelike 
curve with future endpoint y G I^{z) which intersects A twice. This 
gives a contradiction to the achronality of A. I 

The following theorem shows that global hyperbolicity (Definition 8.1.3) 
and Cauchy developments are strongly linked. 
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Theorem 8.3.1. Let A be an achronal set Then int{D{A)) is globally 
hyperbolic or empty. 

First we need to establish the following two lemmas. 

Lemma 8.3.6. Any past inextensible causal curve which passes through 
X e mt{D'^{A)) intersects I~(A), 

Proof. Let 7 be a past inextensible causal curve with future end point 
Xo — X. The set J’^(7)n^ is empty unless 7 intersects I~(A). We choose 
a Riemannian metric h on M and denote for ^ C M the neighbour- 
hood {z e M : disth{z,z) < e} by Be{x). Let {xi} C 7 , Xi+i € J~{xi) 
be a sequence without past accumulation point and let yo e I^{xo) H 
int(D’^(i4)). There is a point yi G I~{yo) H I~^{xi^Bi{xi)). We induc- 
tively define a sequence with yi e I~{yi-i) n I^{xi,Bi{xi)) 

and connect the yi by timelike curve segments. The concatenation of 
these curve segments is an inextensible timelike curve jj> C /"*"(7). Since 
yo e int{D'^{A)), the timelike curve /i must intersect A at some point 
y. There is an yj e fj,n I~{y) which implies xj € This gives a 

contradiction to /'^(7) n ^ = 0. I 

The spacetime depicted in Fig. 8.1.4 violates strong causality along an 
achronal null geodesic. The following lemma shows that for chronological 
spacetimes the set where strong causality is violated is always generated 
by null geodesics. 

Lemma 8.3.7. Assume that (M^g) is chronological. If it fails to be 
strongly causal at x 6 M, then there is an achronal, inextensible null 
geodesic fi through x along which strong causality is violated. 

The end piece of /i to the past ( resp. future ) of x is a cluster curve 
of curves which have intersected (resp. will intersect) arbitrarily small 
neighbourhoods of x before (resp. afterwards) 

Proof. Let Ui be a basis of convex neighbourhoods of x and {fjii}ieN be 
a sequence of inextensible timelike curves such that f^i intersects Ui at 
least twice. Let Xi,yi e Ui n (Xi, yi 6 J'^{xi) be points such that the 
segment of pa between Xi and yi leaves Ui. We denote by yi] (resp. 

— ^]) the past (resp. future) inextensible endpiece of fii with future 
(resp, past) endpoint yi (resp. Xi). 

Let pA[x — ^] be a future inextensible causal cluster curve of {lii{xi — ^ 
]}^eN with past endpoint x and [pii.[xi. — ^]}j€N be a a distinguishing 
subsequence. If there is a point z e pA[x I^{x) then there are 
neighbourhoods Vx ,Vz oix and ^ such that 

Vz C I'^{x) for all X eVx and Vx C I~{z) for all z G V^. 

Let jo € N be a number such that yi,. G Vx for all j > jo. By the 
definition of a cluster curve there is a ji > jo with [xi.^ -^] n V^ ^ 0. 
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Let z e n V^. We obtain a closed timelike curve by first 

traversing [xy^ — ^] from z to yi.^ and then connecting yi.^ e with 

z e Vz through a timelike curve. Since this contradicts the chronology 
of (M,p) we have shown that jj.~^[x ->] fl I'^{x) = 0 and therefore that 
the cluster curve fx'^[x ~»] is achronal. 

Let x] be a future inextensible causal cluster curve of > 

with future endpoint x. As for fi'^[x — ^] we see that x] is 

achronal. 

The concatenation n of x] and ix^[x if future and past 

inextensible. If it is not achronal there exist points G yT [-^ or] and 

€ y^[x — >] with Zj^ 6 and therefore also neighbourhoods Vz_ 

and Vz^ of and such that all £_ € calVz_ and all 5+ 6 calVz^ 
are chronologically related. Let {yi.j[~~> yiJjjeN be a subsequence of 
2 /i]}i€N which distinguishes x]. Hence there is a jo € N 

such that yy[—^ Vij] intersects Vz_ for all j > jo* Consider now the 
subsequence ^ recall that the curves y^] and 

fjiy[xy —»] coincide. We can assume (without loss of generality) that 
the sequence “^]}i>jo distinguishes y'^[x — ^]. Hence there is a 

ji > jo a-nd a curve — >] which intersects Since y~^[x “^] 

is a cluster curve with past endpoint x = lima;^. this neighbourhood 
is intersected before [xy^ = /j,y^ [“4 yy^ ] intersects • Again 
we get a contradiction to the chronology of {Myg). This proves the first 
assertion of the lemma. 

The second assertion follows directly from our construction. I 

Proof of Theorem 8,3.1, We show first that int(D(A)) is strongly causal. 
By the definition of Cauchy development there cannot be any closed 
causal curve in int(D(A)). Assume that the strong causality condition 
is violated at a; € int(D(A)) and let U be an arbitrarily small convex 
neighbourhood of x in int(D(A)). Then there exists a sequence of future 
directed timelike curves 7 i which have x as an accumulation point and 
intersect U twice. There is an inextensible cluster curve 7 through x. 
This curve must intersect A since x € int(jD(A)). By Lemma 8.3.6, 7 
intersects both I~{A) and /"^(A). Since these sets are open they are 
both intersected by 7 ^ for i large enough. From Lemma 8.3,7 we see 
that 7 i first intersects I^{A) and then I~{A). Hence we can prolong 
these curves such that they intersect A more than once. This gives a 
contradiction to the achronality of A. 

We show now that for any x^y £ int(D(A)) the set J^{x) n J~{y) 
is compact. Since there are no closed causal curves in int(D(A)) the 
equation x — y implies J^{x) Pi J~{y) — {x} which is compact. Hence 
we can assume that y £ \ {]/}• We may also assume that y £ 

int(D+(A)) since otherwise x £ int(D“ (A)) and we could apply the time 
reversed argument. Let be a sequence in J‘^{x)r\J~{y). We need 

to show that it contains a convergent subsequence. For each Zi there is 
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a causal curve 7^ from x through Zi to y. If a subsequence of 

lies in D'^{A) U A, we consider the cluster curve with future 
end point y. Otherwise a subsequence of lies in D~{A) 

and we can consider the cluster curve with past endpoint x. Again by 
time-reversal, we can restrict to the first possibility. Since y e D‘*'(A), 
the cluster curve must intersect A. Hence the segment from A to ^ is 
compact and has a compact neighbourhood U. Since this segment is a 
cluster curve, infinitely many of the Zi . lie in U and must therefore have 
a convergent subsequence. I 



Lemma 8.3.8. Let A be a set Then 



D+(A) ~ {x E M : every past inextensible timelike curve 

with future endpoint x intersects A} 

Proof, “c”:. Let x E D'^{A) \ A and 7 be a past inextensible timelike 
curve with future endpoint x. Let W be a convex neighbourhood of x 
which does not intersect A and y E 7 fi ZY be a point different from 
X, Then I^{y^U) is a neighbourhood of x and must therefore intersect 
Let /X be a timelike curve from y to some z G D'^{A) E\U which 
is contained in U. Since it does not intersect A and the concatenation of 
^ and 7 is past inextensible, 7 must intersect A, 

“D:” let X ^ a be a point such that every past inextensible timelike 
curve with future endpoint x intersects A. Now let 7 be a timelike curve 
with future endpoint x and {xi}i^M be a sequence in 7 which converges 
to X. Let be a causal curve which is past inextensible and has fu- 
ture endpoint Xi. Since any causal curve which is not an achronal null 
geodesic can be perturbed slightly such that the resulting curve is time- 
like, the concatenation of Xi and the future end piece ^[xi ~^\ of 7 must 
intersect A. Hence for i sufficiently large, must intersect A. Since Aj 
was arbitrary it follows that Xi E D^{A) and, consequently, x E D’^(A). 



Definition 8.3.6. The future boundary o/D'^(A), 

F+(A) = D^Aj \ /-(D+(A)) 

is called the future Cauchy horizon. The past Cauchy horizon H~ (A) is 
defined analogously. 

The future Cauchy horizon marks the limit of the set which can be 
predicted from knowing data at A. It has the following fundamental 
properties. 

Lemma 8.3.9. Let x E H^{A) \ A, Then x E /“^(A). 




8.3 Achronai submanifolds and Cauchy developments 381 



Proof. There is a neighbourhood U of x which does not intersect A. 
Since for any y 6 I~{x,U) the set I'^{y^U) is a neighbourhood of x it 
must intersect Let z C I'^{y^U) Pi D'^{A). It follows that y G 

D'^(A) C I'^{A) since otherwise we could construct a past inextensible 
curve through .s; which which does not intersect A. I 




Fig* 8*3.2* The Cauchy horizon for a set which fails to be achronai 



Proposition 8.3.1. Let A be a dosed achronai set Its future horizon 
H~^{A) is generated by achronai null geodesics which have either no past 
endpoint or past endpoint in edge{A) . 

Proof. The set P — D"^(A) U I~(A) is a past set since I~(D~^{A)) C 
D^{A)UI-{A). Let y G H^{A) = D^{A) \I~{D+{A)). If y G I’{A) 
then there exists a timelike curve which intersects A twice in contradic- 
tion to the achronality of A. Hence H~^{A) is a subset of the boundary 
of P and therefore a subset of an achronai Lipschitz manifold. 

Let x G (A)\edge{A) and let {xi}i^n be a sequence in I'^{x) which 
converges to x. By the definition of H~^{A) none of the Xi is contained 
in D~^{A). Hence for each Xi there is a past inextensible causal curve ji 
which does not intersect A. Let 7 be a causal cluster curve with future 
endpoint x. We will show that x has a convex neighbourhood C such 
that 7 HC C H~^{A). 

Since Lemma 8.3.9 implies that x G H‘^{A) C Au I'^{A) there are 
two possible cases, x G A\ edge(A) or x G I~^{A). If a; € ^ \ edge(A) we 
choose C such that every timelike curve from I~{x,C) to I'^{x,C) which 
is contained in C intersects A. If x G I'^{A) we choose C C I'^iA). 

It follows that 7 HC C J^(A) U A. This is clear if x G I'^{A). Assume 
therefore that x G A and that there isany€ 7 nC\A. Since A is closed 
there is a convex neighbourhood U <ZC of y which does not intersect A. 
This neighbourhood intersect 7 ^ (for i large enough). It also intersects 
I~{x) since x G A and y G J~{A). Hence there is a timelike curve A in 
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U from /“ (x) to . Since 7^ does not intersect A we can concatenate A 
with a the future endpiece of 7i to obtain a timelike curve from I~{x) to 
Xi G which is contained in C and does not intersect A, This gives a 

contradiction to x € i 4 \edge(A) and we have proved 7PiC C I‘^{A)\JA. 

Assume that there is an ^ € (//nC)\D+(A). If the segment {')C\C)[y -4 
x] of 7 n C between y and x would intersect A then we woul d obtain a 
contradiction to the achronality of A from y G (I'^{A) U A) \ D+(A) c 
I’^(A) and (7 n C)\y -4 x] € J^{y )^ Let W c C be a neighbourhood of 
y which does not intersect D+(A) and consider a point 2 ; G (7'"(y,W) O 
I~^{A)) \ D'^{A). Then the concatenation A of a timelike curve from 2; 
to y in W and the segement (7 0 C)[a: ^ y] does not intersect A. Since 
X G JT’^(A) and A is closed a slight deformation of A results in a causal 
curve from to some point in x G D~^{A) which does not inte rsect A. 
We could now prolong this curve to the past of z e M \ D+ (A) thereby 
obtaining a past inextensible curve which does not intersect A. T his gives 
a contradiction to x G D"^(A), whence we have proved {'jnC) C D+(A). 

Assume that there is an y G (7nC)n/“(D+(A)) and let ^ G I~^{y)r\ 
D'^{A). Let A be a timelike curve from y to >2;. This curve cannot intersect 
A to the future of y because of y G I'^(A) U A and the achronality of A. 
If y ^ A then there is a neighbourhood U of y with U C I~{z) which 
does not intersect A. Since 7 is a cluster curve of there is an i 

and a point y e ji HU, It follows that there is a causal curve A from 
y G ji to z which does not intersect A. If y G A then x e A \ edge(A) 
by the construction of C. The point y has a convex neighbourhood of 
U C C r\ I~{z) which is intersected by infinitely many 7^. Let i G N 
with 'jiHU ^ 0, y G 7 i n W, and A be a timelike curve from y to js;. 
Consider a timelike curve ^ from I~{x)f)C to y which is contained in C, 
The concatenation of y, and the part of 7^ to the future of y intersects A 
because of x ^ edge (A) and Xi G I'^(x). The equation 7 ^ Pi A = 0 implies 
that y intersects A. Hence A cannot intersect A by the achronality of A. 
We have shown that in either case, y ^ A and y G A, the concatenation of 
the past endpiece of 7^ with future endpoint y and A is past inextensible 
and does not intersect A. This gives a co ntradic tion to z G D~^{A). 
Consequently, we ha ve show n (7 nZ^) fl / ~(.D+(A)) = 0 

We have 7 fl C C D+(A) \ I” (D+(A)) = {A). The past endpoint 

X of 7 n C lies in i 7 "^(A) since H~^{A) is closed. If x ^ edge(A) we 
can repeat the construction thereby obtaining a curve 7 n C C H'^{A) 
with future endpoint x. The concatenation of 7 O C and 7 Pi C gives a 
curve 7i C i 7 '^(A) with future endpoint x. Repeating this construction 
inductively we obtain a causal curve joo C 7 /“^ (A) with future endpoint 
X which has either no past endpoint or has past endpoint in edge(A). 
Since H~^{A) is achronal this curve must be an achronal null geodesic. 
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In this chapter we prove and investigate “singularity theorems”. These 
theorems are usually interpreted as an indication that black holes exist 
and that there has been a big bang — or at least that there are regions 
in spacetime where general relativity breaks down. They are one of the 
main motivations for attempting to quantise general relativity. While 
there is a lot of evidence in favour of this interpretation we will see 
that there are also open problems which have to be addressed in order 
to justify this interpretation. 

In Chaps. 7 and 6 we have seen that spacetimes describing a single, 
non-rotating star and the simplest cosmological models of our universe 
contain regions where the curvature diverges. One may think that these 
singularities are only an artifact of our high symmetry assumptions, but 
in this section, we will give an indication that a physically realistic space- 
time must contain such singularities. More precisely, we will show that 
there exist causal, inextensible geodesics which are incomplete. Recall 
that a freely falling particle is represented by a timelike geodesic. If the 
geodesic cannot be extended to a complete one (i.e. if its future end- 
less continuation or its past endless continuation is of finite length), 
then either the particle suddenly ceases to exist or the particle suddenly 
springs into existence^. In either case this can only happen if spacetime 
admits a “singularity” at the end (or beginning) of the history of the 
particle. This singularity may be a curvature singularity, there may be a 
topological obstruction, or spacetime may simply cease to be sufficiently 
smooth. However, the Schwarzschild and Robertson Walker solutions in- 
dicate that these singularities are accompanied with diverging curvature. 
(But cf. Sect. 9.5.1 below where we present a spacetime which indicates 
that such singularities are very mild) . We will prove a singularity theo- 
rem which only establishes the existence of incomplete causal geodesics 
rather than incomplete timelike geodesics. While the innocent looking 
extension to null geodesics is necessary for the proof, the name singu- 
larity theorem is in this case somewhat misleading, because there exist 

^ This should not be confused with pair creation or pair annihilation of parti- 
cles and anti particles, because during these processes nothing really ceases 
or starts to exist. These quantum mechanical phenomenons are merely 
changes of state. 
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perfectly regular spacetimes with incomplete null geodesics contained in 
compact subsets. On the other hand, it has been argued that such exam- 
ples are very special and that in stable, physically realistic spacetimes 
this phenomenon does not occur (cf. (Hawking and Ellis 1973)). 



9.1 Energy conditions 

In general, a maximally extended Lorentzian manifold need not contain 
incomplete causal geodesics. In order to prove a singularity theorem, we 
will have to make some physical assumptions. 

There are two sorts of fundamental physical experience which come 
to mind. Firstly, energy density as measured by the energy momentum 
tensor is positive. Secondly, gravitation is attractive. 

Recall that the energy density measured by an observer 7 (with 
= — 1) is given by e = T( 7 , 7 ). We feel that this energy den- 
sity should be positive. Recall also that in the motivation of the energy 
momentum tensor (cf. Sect. 5.1) we have obtained the energy density 
€ ~ T{U^U) as an average of a positive mass distribution. For our pur- 
pose this should be enough of a motivation of the following definition 

Definition 9.1.1. We say that the weak energy condition holds at x E 
M if 

T(u, u) > 0 for all causal vectors u € TxM, 

For a physical verification of the weak energy condition one would have 
to consider all realistic physical matter models. This is beyond the scope 
of this book but so far the available evidence points to the fact that the 
weak energy condition does hold. 

Gravity is attractive if and only if any two nearby freely falling ob- 
servers will be forced to approach each other under the influence of the 
underlying spacetime geometry. This can be formulated infinitesimally 
in a rigorous manner, A freely falling observer is modelled by a timelike 
geodesic 7 : [a, 6 ] — ^ M. Let /: (—5,5) x [a, 5] M a geodesic varia- 
tion of 7 and J — /s( 0 , •) be the variation vector field. Observe that 
J is a Jacobi vector field. From Taylor’s theorem we get with respect 
to any coordinate system P{Syt) = p{0^t) -f sJ^(t) -h 0{s^). This co- 
ordinate expression can be interpreted in Newtonian terms as follows. 
The observers 7 and /( 5 , ■) have (up to first order) the same rest space 
and are separated by the space vector sJ^. Hence up to first order it 
makes sense to speak of the (Newtonian) force F with which 7 acts on 
/( 5 , •). This force is approximately given by = ~msJ^ where m is 
the mass of the observer /(s, •). (The minus sign is inserted because the 
force vector points from f{s,t) to 7 (t) and J points into the opposite 
direction). Clearly there cannot be a direct translation of Newtonian 
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concepts to general relativity. But for small relative velocities (as in this 
case, (Ms,t)y 7^(t) for s 1) there is a well defined infinitesimal 
limit. In fact, the location of an infinitesimally nearby observer is char- 
acterised by the Jacobi vector field J orthogonal to 7 and the force it is 
acted on is given by F = Since J is a Jacobi vector field, 

this is exactly the force in the following definition. 

Definition 9 . 1 . 2 . Let j be a timelike geodesic with (7)7) ” “ 1 * ^ 
neighbouring freely falling observer J of mass m is a pair ( J, m), where 
J is a Jacobi field along 7 with values in 7-^ and m is a positive number. 

The tidal force which acts between the observer 7 of mass m and its 
neighbouring freely falling observer J is given by 

F = mF(J,7)7. 



The component of F pointing towards the observer 7 can is given by 



F,- 




m 

7W) 






Hence the assertion that 7 attracts the infinitesimally neighbouring ob- 
server corresponding to the Jacobi field J is equivalent to the assertion 
that the sectional curvature of the plane spanned by J and 7 is non- 
positive.^ This motivates the following definition. 

Definition 9 . 1 . 3 . We say that at x e M gravity is attractive in all 
directions if and only the sectional curvature of all timelike planes in 
TxM is non-positive. 

The requirement that gravity is attractive in all directions is very strong. 
Also note that apart from our experience in weak gravitational fields we 
have not much evidence that gravity is really attractive in all directions. 
On the other hand, it is clear that gravity must be attractive on average. 
The reason is that gravity is much weaker than all the other fundamen- 
tal physical interactions. Electromagnetism and gravitation are the only 
long range interactions. On large scales, electromagnetism is not of pri- 
mary importance because it is attractive or repulsive depending on the 
configuration. If this would also be true for gravity, it would be of even 
less importance for astrophysical applications. We know however that 
this is not the case. There are of course several ways of defining averages 
of gravitation. We will restrict attention to our single observer 7 together 
with the associated space of neighbouring, freely falling observers. The 
average of the tidal forces in every direction is given by integrating the 



^ Observer that we neglect the contribution of 7 and the neighbouring ob- 
server to the gravitational field. They are thought to be of negligible mass. 
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tidal force component in every direction over the unit sphere For 

n = 4 we obtain 



3m 
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= -^i?(ei,7)7,Ci = -Ric(7,7) 
i—1 



This motivates the definition 



Definition 9.1.4. The timelike convergence condition holds if 

Ric(u, u) >0 for all causal vectors u. 

Using Einstein’s equation this condition can be re-expressed in terms of 
the energy momentum tensor: 

T{u^u) — ^ 0 all causal vectors u. 

While this inequality is not implied by the weak energy condition (nor 
does imply it), the timelike convergence condition is often also called the 
strong energy condition. The following lemma gives a partial motivation 
for this terminology. 

Lemma 9.1.1. Assume that g andT can be simultaneously diagonalised 
so that the energy density e and the principal pressures pi of T are de- 
fined. Then the weak energy condition is equivalent to 



e > 0, 



e4-Pi > 0 for € {1, . . . , n - 1}, 
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and the timelike convergence condition is equivalent to 

n-l ^ 

(n - 3)e + Pi - -7- ^ c + Pi>0 for alii e {1, . . . , n - 1}. 

47T 

t=l 

Proof. Let {eo, . . . , en~i} be an orthonormal basis of TxM which diago- 
nalises T. Any causal vector u can be written as u = c(eo -f 
where numbers with < L Hence the weak energy condi- 

tion is equivalent to 

n— 1 n— 1 n— 1 

0 < T(eo + c*6i, eo + c*ej) = e + '^{c’‘)‘^pi 






The condition for the weak energy condition is sufficient since the factors 



andO 



are positive. It is necessary since we can choose the numbers = 0 or 
for any given j the numbers d — 5j, 

The strong energy condition is equivalent to 



0 < (n “ 2)Ric(eo + ^ 4- ^ c^e^) 

i=l 

n— 1 n— 1 

= (n - 2)T(eo + E + E 

i— 1 i~l 

y A \ n~l n— 1 

- ( tr(T) - T- ) 9{eo + E + E 



- (n - 2 ) ( e + I + E^'^' “ i;: 



(n-2)y^(c’f (e + pi) 



Hence the assertion for the timelike convergence condition follows by the 
same argument. I 
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It seems physically more plausible to demand that the matter distri- 
bution of our universe satisfies the inequality e > 0 rather than the 
inequality e + Pi ~ > 0 . In this sense the timelike convergence 

condition is “physically” a stronger condition on the matter distribu- 
tion than the weak energy condition. Since the cosmological constant 
is close to zero and the energy density is much larger than the princi- 
pal pressures^ the timelike convergence condition is still a very plausible 
assumption to make. 

A further condition we want to impose is the genericity condition. It 
states that along each causal geodesic 7 there exists a point at which 

Yl"^l[aRh]cd[elf] ^ 0 

holds. This condition is only of technical nature^ because the set of met- 
rics which satisfy the strong energy condition is dense in the set of metrics 
which satisfy both the strong energy and the genericity condition. Here 
we impose the (very fine) C^-Whitney topology (for details and proofs 
see (Lerner 1973 )), 

The mathematical significance of the strong energy condition in con- 
junction with the genericity condition is clear from the following corollary 
to Proposition 4 . 6 . 3 . 

Corollary 9 . 1 . 1 . Assume that (M,^) is a Lorentzian manifold and let 
7: (a, 6) -4 M be an inextensible causal geodesic. If Ric{j{t)^'y(t)) > 0 
for all t and the genericity condition holds along 7 then either 

(i) 7 is incomplete, or 

(a) 7 contains a pair of conjugate points. 

Proof We have to show that the genericity condition implies the existence 
of a ^0 such that the map 

R- (7(^0))^ Rv- R{v,'y)'j 

is not identically zero. 

If 7 is timelike and Y{t)Y{i)%{t)Rb]cd[e 7 f]{l^) ^ 0 then we have 
in particular 7^(t)7^(t)i^6cde ^ 0 . Since a symmetric bilinear form is 
determined by its associated quadratic form there is a vector ^ G (7(t))"^ 
with 5^(i?(67(t))^,7(t)) 0 . Consequently, P{i,i{t))^{t) ^ 0 . 

Suppose that 7 is null and that 7^(^)7^(^7[a(^)^6]crf[e7/](f) ^ 0. We 
choose a basis of Ty(*)M such that = 7(^0) and (e^, e\f) = 

- 0, {er,es) Srs ~ I for j,k € - 2}, r,s e 

{n — l,n — 2}. In this and the associated dual basis we have 7^ = 5 ^ and 
7a = This implies 

^ Recall that in physical units where the velocity of light c is not normalised 
to 1 the numerical value of the energy density increases by a factor (? in 
comparison to the principal pressures. 
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mt)i‘'it)%mb]cd[ejf]{t) = %-^Rb]nn[eS]]-^ 



rn— Ip 7? 

— <^a -^bnne^f ~~ ^anne^J f 

+ Sr^RannfSr^ ~ 8^^ R^nufS^-^ ■ 



If this expression does not vanish then either a or 6 must be equal to n— 1 
and so must be either e or /. For definitenesSj assume that 6 = / = n~ 1. 
The formula then simplifies to 






Ft(n~ 



•{n~l)nne 



Ra 



+ Rann(n-l)^e ^ ~ ^ R{n-l)nn{n-l)^e 



This clearly vanishes if n e {a, e}. The first two terms and the last two 
terms cancel pairwise ifa = n- l. Ife = n-1 the second summand 
cancels the third one and the first summand the fourth summand. Hence 
Yi'^)'y^{'^)%i'^)Rb]cd[eJf]{t^ ^ 0 implies that a,e e n - 2} and 

Ranne ^ 0- But this implies in turn that ^ (7(0)“^ 

does not vanish. I 



9.2 Closed trapped surfaces 

As a further preparation, consider an isolated, dense object, say a star, 
in spacetime. If it produces enough gravitation, it will not only attract 
material objects but even the light rays it sends out. (Recall that they 
are modelled by null geodesics and therefore perceptible to the curva- 
ture of spacetime). Exactly this situation happens in the Schwarzschild 
spacetime. To be more concrete, let T be stn (n — 2)~dimensional space- 
like submanifold of M. We may think of T as the surface of the star 
at a fixed time. We can now send out light orthogonal to this surface, 
either in direction to the centre of the star or into the opposite direction. 
Everyday experience suggests that the light congruence directed to the 
centre should converge while the light congruence directed into the op- 
posite direction should diverge. However, this does not take into account 
the extrinsic curvature of the spacelike hypersurface which represents 
the instant of time. There are many examples where both commences 
converge, for instance the surfaces r ~ const < 2m, t = const in the 
Schwarzschild solution. These surfaces are in the black hole region of the 
Schwarzschild solution and the general interpretation is that the gravi- 
tation of the black hole is so strong that it forces even initially outgoing 
light rays to converge. 

Since the normal bundle of T in M is a Lorentzian plane at each 
point, there exist two future directed null vector fields N^^N- along 
T which are orthogonal to T and satisfy (W_,iV+) = —1. They are 
unique up to transformations of the form N± where a G 

C^(T,R+\{0}). 
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Let X £T and be an orthonormal basis of TxT. Then 

the requirement that both light concruences immediately start to con- 
verge can be expressed by the inequalities 

n— 1 n— 1 

A=2 A=2 

This requirement can be formulated in a manifestly invariant way by 
using the mean curvature vector field H oiT (cf. Definition 4.4.2). 

Definition 9.2*1. A closed (future) trapped (n— 2)-surface (respectively 
closed (future) strictly trapped (n-2)-surface, closed (future) marginally 
trapped (n — 2)-surfaceJ is a closed (n — 2) -dimensional spacelike 
submanifold T such that the mean curvature vector field H ofT is past 
pointing and causal (respectively timelike, null). 

The mean curvature vector field is defined hy H = ^a)- 

Any vector v eTxM can be decomposed as u — + v~N- + 

where and v± = ~ (v,N^:), Using this decomposition , we 

obtain 

A=2 

= (trr(x-)A^+ + trr(x+)^-) , (9.2.2) 

where x± = are the null second fundamental forms. Like N±, these 
null second fundamental forms are uniquely defined up to transforma^ 
tions of the form x± ^ where a € C'^(T, \ {0}). The null 

expansions are 9^ = (x±)ab* T is a strictly closed 

trapped surface if and only if both null expansions are everywhere neg- 
ative on T and therefore equivalent to Inequalities (9.2.1). 



9.3 The singularity theorem of Hawking and Penrose 

We are now ready to state the main result of this chapter, 

Theorem 9.3.1. A spacetime {M,g) is not causal geodesically complete 

¥ 

(i) the strong energy condition and the genericity condition hold, 

(ii) The chronology conditions holds, 

(Hi) There exists at least one of the following: 

(a) a strictly closed trapped surface, 

(b) a compact achronal set without edge, 
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(c) a point x such that along every past (or every future) inexten- 
sible null geodesic from x the expansion of the null geodesics 
starting at x becomes negative. 

Strictly closed trapped surfaces are expected to surround very dense 
stars. The paradigmatic example is the Schwarzschild solution. Further 
evidence is provided by some theorems which prove the existence of 
strictly closed trapped surfaces if the concentration of matter is high 
(Schoen and Yau 1983; Bizon, Malec, and O’Murchadha 1988). Condition 
(b) is satisfied for spatially closed universes such as the Robertson Walker 
spacetimes with positive curvature. Condition (c) seems to be satisfied 
for our point in the universe (assuming a spacetime which differs only 
slightly from a Robertson Walker cosmology). This indicates that there 
was a big bang or that there will be a big crunch. (For more details 
cf. (Hawking and Ellis 1973, p. 358)). 

Theorem 9.3.1 will follow as a corollary of the following proposition. 

Proposition 9.3.1. The following three conditions cannot hold all to- 
gether. 

(i) every inextensible causal geodesic contains a pair of conjugate 
points, 

(a) {M,g) is strongly causal 

(Hi) there is an achronal set A such that E'^{A) or E^{A) is com- 
pact 

Proof that Theorem 9.3.1 follows from Proposition 9.3.1. Assume, that 
(M, g) is causally geodesicaJly complete and satisfies the chronology con- 
dition. By Corollary 9.1.1, the strong energy condition and the genericity 
condition imply that any inextensible causal geodesic has a pair of conju- 
gate points. In particular, there do not exist maximal, inextensible causal 
geodesics. It follows that {M,g) must be strongly causal, since otherwise 
it would contain an inextensible achronal null geodesic by Lemma 8.3.7. 

If {M,g) contains a strictly closed trapped surface T, then E^{T) C 
dJ'^{T) is generated by null geodesics. These null geodesics are orthog- 
onal to T and the definition of a strictly closed trapped surface implies 
that each of them has a focal point (cf. Proposition 4.6.2). Since T is 
compact and E*^(T) is generated by null geodesics without focal points 
it follows that E~^{T) is also be compact. An analogous argument shows 
that in case (c) the set E^{x) is compact. 

If {M,g) contains a compact achronal set A without edge, then 
E'^(A) — A. This follows since E+(A) = J~^(A) \ J+(A) and Corol- 
lary 8.3.1 imply that through every point x € E'^(A) \ A there is a 
generator of E~^(A) which intersects edge(A). Hence the set E~^(A) is 
also compact. 
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Properties (i) - (iii) of Proposition 9,3.1 would therefore have to hold 
under the conditions of Theorem 9.3.1 and the additional assumption 
that all causal geodesics are complete. I 

The idea for the proof of Proposition 9.3.1 is simple but the proof itself is 
quite involved. We will therefore first give an outline and then establish 
a sequence of lemmas which will imply the proposition. 

Suppose, (i), (ii), (iii) in Proposition 9.3.1 hold and assume without 
loss of generality that E'^{A) is compact. We will show that the hori- 
zon is non-compact or empty. Every non- vanishing vector 

field U must have a future inextensible integral curve 7 in D'^{E'^{A)). 
Otherwise we could map the compact set E'^{A) along the integral 
curve of U onto H'^{E'^{A)) which in turn would have to be compact 
(and non-empty), too. We apply a similar construction to the past of 
E'^{A)r\J~{'y) and obtain an inextensible causal curve which is wholly 
contained in D{E‘^{A)), This curve can then be used to construct an in- 
extensible maximal causal geodesic in contradiction to (i). 

In order to carry out this program we will prove the following facts. 

1. H+(B+(A))CF+(5J+(A)), 

2. The Cauchy horizon JT"^(jE+(A)) is non-compact or empty. 

3. There is a future in extensi ble timelike curve 7 C D~^{E'^{A)). 

4. Set E := E'^{A) n J”(7). Then there is a past inextensible curve 

\CD-{E^{E)), 

5. There is an inextensible causal geodesic without conjugate points in 

The last property 4 is in contradiction with (i) of Proposition 9.3.1. 
Lemma 9.3.1. Let A be a closed achronal set Then the inclusion 
if+(E+(A)) c if+(9J+(A)) 

holds. 

Proof, Let a: e if+(E+(A)) \ if+(SJ+(A)). Prom E+(A) C dJ^{A) we 
obtain I1+(E+(A)) C D^[dJ+{A)) and therefore a; € /-(D+(9J+(A))). 
Hence there is ay e fl D'^{{dJ'^{A))), 

We will first show that fl I~{y) does not intersect 9J+(A). 

Assume that there is a point 2: G dJ~^{A) O I*^(a:) O I~{y), Then the 
open set I~{z) is a neighbourhood of a: 6 H~^{E~^{A)) and intersects 
therefore D+(E’^(A)). Since every past inextensible timelike curve with 
future endpoint in D^{E'^{A)) intersects E"^{A) C dJ^{A) we would 
find a point z G I~ {z)ndJ'^ {A) C {dJ~^ {A))ndJ'^~ (A) in contradiction 
to the achronality of 9J"*‘(A). 

Since I~{y) is a neighbourhood of a: G H'^{E~^{A)) and J+(a:)n/"(T/) 
does not intersect dJ~^{A) there is a past inextensible timelike curve 
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7 which has future endpoint y and does not intersect E~^{A), Prom 
y e D'^ (A))) we see that 7 does intersect dJ‘^{A) at some point 2:. 
Let fx be the generator of dJ~^{A) with future endpoint 2. By Corollary 
8.3.1 this generator is either past inextensible or intersects edge(A). We 
will show that both cases lead to a contradiction. 

Assume first that there is a point z e edge(A) which is intersected 
by fx. This point also lies in A since A is closed. Hence jx is contained in 
J’^(A) which in turn implies ^ € J'^{A) D dJ~^(A) = E~^(A). This is a 
contradiction to the construction of 7. 

Assume that ix is past inextensible and does not intersect A, Since 
7 is timelike and has future endpoint in D'^{dJ^{A)) it intersects the set 
int(D"^(9J*^ (A))). This implies that fx intersects I” (dJ'^(A)) (cf. Lemma 
8.3.6). The inclusion fx C dJ~^{A) gives a contradiction to the achronality 
ofaj+(A). I 



Lemma 9.3.2, Let A be a dosed achronal set such that J+ {A) is strongly 
causal. Then H~^{E'^{A)) is non-compact or empty. 

Proof, Suppose that is non-empty but compact. Since J^{A) 

is strongly causal, H'^{E'^ (A)) can be covered by a finite number of 
convex neighbourhoods Ui with compact closure such that no Ui is in- 
tersected twice by any causal curve. Let zi e H^{E'^{A)) and 
be one of the convex neighbourhoods Ui with Zi e W^(i). Because of 
Lemma 9.3.1 there is a point xi € J~^{A) n \ D-^{dJ{A))). By 
Lemma 8.3.8, there is a timelike pa st inextensible curve ai through xi 
whi ch does not in tersect D~^(dJ{A)). Hence ai neither intersects dJ^{A) 
nor £)+(£+ (A)). Since ai does not intersect dJ'^(A) it is contained in 
int(J+(A)) = I~^{A), The curve ai leaves Ui^i) because of its compact- 
ness. There is a point yi € ai \Z4(i) C I'^{A), Let /?i be a past directed 
timelike curve from yi to A, Since A C E'^(A) and £'+(A) is an achronal 
(topological) hypersurface this curve must intersect D‘^(E"^(A)) and 
therefore also H~^{E~^{A)), Let Z 2 € /3injff+(£;+(A)) and let W^(2) be one 
of the convex neighbourhoods Ui with 22 e Ui( 2 ), The neighbourhoods 
Ui(^i^ and Ui( 2 ) are different since by construction we have Z 2 € 
and since no Ui can be entered by any causal curve twice. By induc- 
tion we obtain an infinite sequence of pairwise disjunct neighbourhoods 
{^i{k)}k€N 131 contradiction the finite number of sets Ui, I 



Lemma 9.3.3. Let A be a dosed achronal set such that J~^{A) is strongly 
causal and assume that E'^{A) is compact. Then there exists a future in- 
extensible timelike curve 7 which is wholly contained in D'^{E'^{A)). 

Proof. Without loss of generality we can assume that {M^g) is time 
oriented. Hence there exists a timelike, time oriented vector field V on 
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M. Since E~^[A) is an achronal hypersurface all future directed timelike 
curves with past endpoint in E'^{A) are initially in int(D"*'(£''^(i4))), If 
every integral curve of V intersected {E‘^ {A)) after having intersected 
E~^{A)^ we would obtain a continuous map E~^{A) — > H'^{E'^{A))^ x 
Pt{x){^) where F is the flow of V and t(x) > 0 the unique number 
with G II'^(E~^(A)), This map would be surjective because, 

by Lemma 8.3.8, every past inextensible timelike curve which intersects 
the event horizon of a closed set must intersect this set as well. Since 
E~^(A) is compact, H'^(E'^(A)) would also be compact in contradiction 
to Lemma 9.3,2. Hence there is at least one future inextensible integral 
curve 7 of which is contained in int(D~^ (E‘^ (A))) . I 

Lemma 9.3.4* Let (M^g) be a causal geodesically complete and strongly 
causal spacetime in which every inextensible causal geodesic has a pair 
of conjugate points. Let A be a closed achronal set with compact fu- 
ture horismos E'^{A) and let 7 be a future inextensible timelike curve in 
D-^{E-^(A)). 

Then there exists a past inextensible ti melike c urve A which is con- 
tained in D"'{E~{F))) where F ~ E'^{A) fi ^“(7). 

Proof We will first show the inclusion E~{F) C FU dJ~('y). 

Let x € E~'{F) \ F. If there was a point x G H E^{A) then 

I^{x) would be a neighbourhood of x and therefore intersect I'~ (E'^(A)) 
in contradiction to the achronality of E^{A). Hence I~{x)f]E'^{A) = 0. 
If a; € /“(7) then there is b, z G I'^ix) O I~(7). Denote by p a. timelike 
curve from x through 0 to 7. This curve must intersect E~^{A) since 
7 C D'^{E'^{A)) and I~{x) fi E~^{A) ~ 0. Since this intersection point 
is in E~^{A) fl /“(7) C F we obtain x G I"{F) in contradictio n to th e 
assumption x G E'{F). Hence we have x G J~{F) \ /”(7) C J~(7) \ 
J“ ('y) — 0 J- (7) and the assertion E~ {F) C Fl) dJ~ (7) follows. 

The set F is the intersection of a closed and a compact set and there- 
fore compact. Since 7 is future inextensible, all generators of 0J~(7) 
must be future inextensible as well. Suppose, there was a sequence fii of 
generators of E~~{F) with diverging affine lengths. Since F is compact, 
there would exist a cluster curve (3 of {pi}ieN which would be past in- 
extensible. But then its geodesic prolongation would be an inextensible 
generator of By assumption this generator cannot be achronal 

which gives a contradiction to the to the achronality of 9J~(7). Hence 
E~(F) is compact and we can apply the time reverse of Lemma 9.3.3. 



Lemma 9.3.5. Let C be a compact subset of M . If D^ (C) contains a 
future inextensible timelike curve 7 and D~{C) fl J~{'y) contains a past 
inextensible timelike curve A, then D{C) contains an inextensible causal 
geodesic without conjugate points 




9,3 The singularity theorem of Hawking and Penrose 395 



Proof. Let {yi}ien be a sequence of points in 7 without accumulation 
point such that yij\.i £ Choose a sequence in A such 

that yi 6 I~^{xi) and Xi € for all i. For every i we obtain 

a causal curve jli which joins Xi via C to yi. This causal curve is con- 
tained in a globally hyperbolic set (Theorem 8.3.1) and can therefore be 
replaced by a maximal geodesic segment fii (Proposition 8.2.2). With- 
out loss of generality we have € C. Then the oriented half lines 
{(E+ \ {0}) • i e N} have an accumulation point i in the space of 

causal directions over C, because this space is compact. Any inextensible 
geodesic with fi{0) 6 ^ is a cluster curve of the sequence Since 

any cluster curve of maximal geodesics is maximal, the curve fx does not 
have a pair of conjugate points. I 

Proof of Proposition 9.3, L We only need to choose C = E~' [T). I 



9*3.1 Applications of the singularity theorem 

(i) Consider the Schwarzschild solution. It satisfies all assumptions of 
Theorem 9.3.1 with the exception of the genericity condition. How- 
ever, it seems plausible that any generic perturbation of the Schwarz- 
schild solution using a reasonable matter model should result in a 
spacetime which satisfies all the assumptions. Here it s important 
that the existence of a strictly closed trapped surface is an open con- 
dition, i.e., if a spacetime which contains a strictly closed trapped 
surface is slightly perturbed then this surface is also a strictly closed 
trapped surface in the perturbed spacetime. Hence Corollary 9.1.1 
indicates that the Schwarzschild singularity is stable under (physi- 
cally reasonable) perturbations. In particular, it is not an artifact of 
the high symmetry of the Schwarzschild spacetime. This application 
of Corollary 9.1.1 is one of the main reasons why the existence of 
black holes is widely accepted. 

(ii) Consider a Robertson Walker solution without cosmological con- 
stant and spacelike hypersurfaces of constant, positive sectional cur- 
vature. These spacelike hypersurfaces t = const are compact achro- 
nal sets without edge. The Ricci tensor, which is given by 

Ric = T- = I (3(6 +p) + {€~p))U^ ®U'’ + {e-p)g) , 

is positive definite for e > p > 0. Hence in this case the timelike con- 
vergence and the genericity conditions are both satisfied. It follows 
that all assumption of Theorem 9.3.1 hold even if the spacetime is 
slightly perturbed. This indicates that (at least for closed universes) 
the big bang is not an artifact of the symmetry properties of the 
Robertson Walker cosmologies. 
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(iii) Consider again a Robertson Walker solution (M,g) of arbitrary 
constant sectional curvature and assume that c > > 0. Then the 

genericity and the timelike convergence conditions are satisfied. We 
will now show that (M, g) contains a strictly closed trapped surface. 
By Corollary 6.1.2 there are coordinates and a positive 

function t a{t) such that 

g = ~dt^ -I- a^{t) — ^^dr^ -f (d9^ + sin^(^)d<^^)^ . 

Consider the codim~2 surface — {x : t{x) = i,r(x) = f}. This 
surface is clearly compact and spacelike. The vector fields 





\/l — 
a 




along ^ are normalised null vector fields orthogonal to and 
the corresponding 1-forms are given by 



Hence 

e± - t,r„{VNi) = ^ (vs,Nlo.) + 

where for the third equation we have used spherical symmetry. Since 
for i 6 {iC,r} we have = lg^^(2degje - digee) = -\g‘^'’di{oP'r^) 
we obtain FIq = -dta/a and Fgg = -1/r. This implies 



^ i^UN±)t + r^e{N±)r) 



2 f dta ^ a ^ 
\ 2u ^ r\/l - er^ ) 



Hence for a < 1, > 0 both expansions, 9± are negative. Since 

these inequalities are satisfied near the big bang we can apply the 
time reverse of Theorem 9.3.1 to infer the existence of a singular- 
ity. In fact, in our example this singularity is just the big bang.'^ 
If we perturb our spacetime slightly all assumptions of Theorem 
9.3.1 are still satisfied. Hence we can conclude that the perturbed 
spacetime also contains an incomplete, inextensible causal geodesic. 
It is therefore natural to expect that the big bang is stable under 
perturbations of the metric. 

^ Note, however, that Theorem 9.3.1 does not make any assertion about the 
location of the singularity. In particular, it does not assert whether it is to 
the future or to the past of the strictly closed trapped surface. 
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(iv) Using condition (c) of Theorem 9.3.1 it is also possible to give argu- 
ments in favour of the existence of a singularity without assuming 
that our universe is well described by a Robertson Walker space- 
time. This argument requires assumptions on the spectrum of the 
microwave background radiation and is therefore beyond the scope 
of this book (cf. (Hawking and Ellis 1973, pp. 354 - 359)). 



9.3.2 General problems with Theorem 9.3.1 

These physical applications of Theorem 9,3.1 suffer from two defects 
which are often considered to be negligible. Firstly, instead of the for- 
mation of singularities spacetime could form closed timelike curves. Sec- 
ondly, even if singularities occur, Theorem 9.3.1 does not predict their 
strength. For a long time, it has been thought that these problems are 
only technical and that the theorem could be sharpened accordingly. 
Unfortunately, this is not the case. In Sect. 9.4 we will present an ex- 
ample (due to Newman) that the chronology condition is necessary for 
Theorem 9.3.1. In Sect. 9.5 we will show that Theorem 9.3.1 may only 
predict the existence of singularities which are too weak to be taken se- 
riously by most physicists. At the time of writing it is not clear whether 
it is possible to improve on Theorem 9.3.1 if additional physically realis- 
tic assumptions are made. It should be remarked that other singularity 
theorems suffer similar defects. 



9.4 Singularities and causality violations 

The chronology assumption in Theorem 9.3.1 is a global assumption and 
therefore can not be verified by physical measurements. While it is often 
considered to be self-evident we have seen in Chap. 8 that this view is 
to be debated. It is therefore an important question whether Theorem 
9.3.1 continues to hold even if the chronology condition is dropped. In 
this section we will give an example due to R. P. A, C. Newman (1989) 
which proves that the chronology assumption is essential. We will then 
quote a generalisation of Theorem 9.3.1 which sheds some light on what 
is going on. 



9.4.1 The Godel solution 

The proofs of Propositions 94J and 9.4,2 consist of straightforward 
but long calculations. We will not spell them out in all details. How- 
ever j we will provide enough information such that a careful reader 
equipped with pen & paper (better: with access to a symbolic computing 
program such as REDUCE, MAPLE or MATHEMATICA) should be 
able to fill in the missing details. 
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In 1949, the famous mathematician Kurt Godel published a new solution 
to Einstein’s equation for a dust matter model with a cosmological con- 
stant. His solution is completely homogeneous and has the property that 
there is a closed timelike curve through each point, Newman’s counter 
example is a modification of the Godel solution. 

Definition 9.4.1. Let (t^x^y^z) be standard coordinates ofM.^ andcv G 
R+\{0}. The spacetime -dr+dx^+le^^‘^^dy‘^+dz^-2e'^'^^dtdy) 
is called the Godel solution. 

Proposition 9.4.1. The Godel solution is a Lorentzian spacetime and 
satisfies 

Ric - ^Scal^ - Lo^g = Stt {dtf . 

It corresponds therefore to a dust solution with negative cosmological 
constant. 

Proof For the first claim observe that (<9^)*’ = ~{dt + e'^^^dy) implies 
g = - {dtf ® {dtf + dx^ + + dz\ 

For the second claim we will have to calculate the Ricci tensor. Since the 
metric depends on only one variable this is a simple task and left to the 
reader. Here we only note that the values of the Christoff el symbols are 

r;, = r‘„ = -yi^,iZ = n, = = rj = 

n, - n, = 

where it is understood that all other Christoffel symbols vanish. The 
Ricci tensor reads then 

Ric = 2^2 + e^^^dy^ + 2e^^dtdy) = 2u? (pt)^ ® (pt)^ 

and we obtain 

Ric —g — oP' ^2 {pif ® {pif -1- yj . 

I 

The following lemma implies that the Godel solution is homogeneous in 
space and time. 

Lemma 9.4.1. For any two points p,q e M there is an isometry 
p: M M which satisfies 'ip{p) ~ q. 
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Proof, The spacetime M admits the following four 1-parameter groups 
of isometries^. 

V-i (a) : {t, x,y,z)\-^ {t + a, x, y, z) 

: {t, X, y, z) ^ («, x + a, z) 

tps{a) : {t, X, y, z) (t, x,y + oc, z) 

iPa{o!) : {t, X, y, z) {t, x,y,z + a). 

It follows that for any p^q e M there are numbers o:i, . . . , a 4 such that 
P = '^ 4 ( 0 ^ 4 ) o ^s(c^3) o '02(c^2) o I 



The following lemma implies that the Godel spacetime does not contain 
singularities. 

Lemma 9.4.2. Each inextensible causal geodesic in [M^g) is complete. 

Proof, We will first partially solve the system of equations for geodesics. 
To this end it is practical to work with slightly different coordinates. 
Consider the global coordinate transformation : M RxR’^\{0}xM^, 
{t,x,y,z) 1 -^ {i,x,y,z) where t = i, = -~ln{^/2uJx), y = \/2y, and 

z = z. In these coordinates the metric reads 

g = -(dt-h ■^dy)'^ 4* +dy^) +dz^, 

UJX u^x^ 

We have the 1-parameter families of isometries 

{a ) : (t , X, y, z) {i a, x, y, z) 

'il)2 : (i, X, y, z) ^ (i, (1 -4 a)x, (1 + a)y, z) 

(a) : (t, x,y,z)^ (t , x,y-\-a, z) 

{i,x,y,z) ^ {i,x,y,z a). 

The corresponding Killing vector fields are 

^2 — ^ 9 x + ydy ^ ^3 = ^4 = ^2. 

These Killing vector fields give four constants of motion along the geode- 
sic, (^i, 7 ) = a is constant. Writing 7 (r) = {t{r),x{r),y{T)jZ(T)) we 
obtain 



Cl =~t- 



A, 

UJX^ 



C2 = - 



2u^x 



C 3 = 



yx “ 2ufxi — y 



C3 = z. 



This is a linear system of equations for S), and its solution is 

given by 

^ The isometry group is five-dimensional, but the additional 1-parameter 
group of isometries is not important for the argument. 
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i~ci-2u;xcs, x = 2 u)^x{c 2 - ycs), y = 2uj‘^x(-~— + xcs), 5 = C 4 . 

UJ 

If we rename (ci, C2, C3, C4) obtain 



_ x' ~ 2x ^ _ \/2u)x{y — y') 
^ _ ^/2lox{x — x^) _ d 



(9.4.3) 



We can assume without loss of generality that (7,7) = rf € {—1,0,1}. 
Inserting our values for ?, x, y, i into this equation we obtain 

(y - y^f + {x~ ^ + TjC^. 

Hence the projection of 7 to the {x, y) -plane traverses an arc of a cir- 
cle with centre (x\y^) and radius ^ + 7]C‘^ . Observe that for 
causal geodesics we have the inequality 

which iniplies that the circle is wholly contained in a compact subset of 
R'^ \ {0} X R. In particular, the coordinates x, y in remain bounded along 
7. Equations 9.4.1 imply that 7 is bounded in our coordinate system. 
Since these coordinates are global, it follows that the affine parameter 
must range from —00 to 00. I 



The Godel solution is axially symmetric with respect to any point. To 
see this, we will introduce different coordinates. 

Proposition 9.4.2. There is a dense open set N C M and coordinates 
(s,r, z) 6 M X \ {0} x 5^ x R such that is given by 

g = ^ — ds^ + dr^ 4- sinh^ r(l — sinh^ r)d(p^ 

+ di^ “ 2\/2 sinh^ rd</?ds^ . 

Proof, Observe that the metric is a direct product, (M^g) = (R^ x 
R, + d^^) with h — ~dt^ + dx‘^ + — 2e^'^“'dtdy. Hence it is 

sufficient to show that there is a dense open set N CR^ and coordinates 
(s, r, V?) such that 

hjjy = 2u)^ + dr^ + sinh^ r(l - sinh^ r)d<^^ - 2\/2 sinh^ rd<^d5^ , 
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The assertion follows then with z — We define the coordinates 

(s,r, (/?) via the equations 



g-v/ 2 wn: „ cosh(2r) + cos(v?) sinh(2r) (9.4.4) 

= sin((/?)sinh(2r) (9.4.5) 

i((^ + “ \/2s)^ = e“^^ tan • (9.4.6) 



To show that h has the desired form in these coordinates is “straightfor- 
ward” but very cumbersome.® 

We first differentiate Equations (9.4.4)-(9.4.6) to obtain 



0 = - 2(sinh(2r) + cos{ip) cosh(2r))dr 

+ sin{(p) sinh(r)d 9 P 

0 = y/2uj'^e^^^ydx + e^^^^wdy - 2 sin(<^) cosh(2r)dr 

- cos((^) sinh(2r)d(/? 

0 = ^ (^1 + ^tan(t/?/2 + l/2(jjt ~ l/2\/2s)j ^ codt 

- i + ^tan( 9?/2 + cot/2 — s/V2)^ ^ V2ds -I- 20 ”^^^ tan((^/2)dr 

2 ^ ~ ^ + (tan(v?/2))^j + 1 

+ ^tan(<^/2 d-ujt/2- s/V2)^ \ d(p. 



In this system of equations we can eliminate y and using Equations 
(9.4.4) and (9.4.5). The system can then be considered as a linear system 
for dt, dx, dy which only depends on s, r, cp, d$, dr, dip. Solving this linear 
system gives (after some simplifications) 

_ y/2ds ^ sm{ip/2)cos{(p/2)e^'^dr 

CO (jj (e^’^ cos‘^{(p/2) + sin^(y?/2)) 

^ (- sin^(<^/2) 4- e^’^ - cos^{(p/2)) dip 

CO ^e^’^cos2((^/2) + (sin((/)/2))^j 

dx — 1/2^ (2e^^cos^((/?/2) - 2s\T?{ip/2)) dr 
CO (e^’'cos^(</?/2) -f sin^((^/2)) 

® In fact, in his original paper, Godel chose to derive this form independently 
of the geometric assumptions which led him to the metric. 
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I ^ /o V^sin(l/2y)) cos((/?/2) (l - e^’’) dy 
to (e'*’’ cos^(v?/2) + sm^{(p/2)) ’ 

dy — i sin(tp/2)cos{(p/2)e‘^’’dr 

u) (e^’’ cos^(^/2) + svc?{tp/2)'f' 

I cos^(y/2) - dtp 

to (e^’’cos^(v?/2) + sm^(v?/2))^ 

We can now simply calculate h= -dt‘^ + dx^ + - 2e'^^®dMj/ 

in the coordinates (s, r, (^) using our expressions for dt, do:, dy and Equa^ 
tion (9.4.4) (which is equivalent to 

^V2ujx „ ^-2r cos^((/?/2) + (sin((^/2))^^ ). 

This gives with 



A{r, if) = sin^((/?/2) -f cos^(y?/2) - e^’’ 
B(r, (/?) = - sin^((^/2) + - e'^^ cos^(</?/2) 



and using trigonometric identities 



/ V^e-^M(r,y)+2jg(r,y)x/2 \ 

^ (e^’^cos^((/?/2) +sin^(9?/2))y 

^ sin^((/?/2) cos^(v:?/2) 

\ o;2 (e^’’ cos2((/p/2) + sin^((/:)/2))^ 

+ o (sin^(V^/2) ~e^^cos^((/:)/2))^ \ ^^2 

(e^^ cos^((^/2) + sin^((^/2))^y 

\ a?^^e^’'cos 2(y>/2) + (sin(^/2))^) 



d<^ds 



- I ^ y))^ - 2 cos^('^/2) sin^(y/2) (l - 

(e^’’cos2(yi/2) + sin^((^/2))^ 

= 2o;“^( - ds^ + dr^ + sinh^ r(l - sinh^ r)d(^^ H- d^^ 
— 2\/2 sinh^ rd(^ds) , 



dip^ 



We will now show that there are spacelike submanifolds given by r = 
const, s = const which have vanishing expansions. They are our candi- 
dates for strictly closed trapped surfaces in a suitably perturbed metric. 
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Proposition 9.4.3. Let s £ % r e \ {0}, and he the codim- 
2 surface given by s = s,r = f. Then is spacelike if and only if 
f < ln(l + \/2). In this case its mean curvature vector field reads 



H = 



1 -2sinh^(r) 
2\/2 sinh(r’) cosh(r) 



(N+-N-), 



where 



du> 



Jl -sinh^(r) 1 1 

iV± = V _ ^;~ ds ± -^dr + , - 

v2cosh(r) v2 cosh(r) y 1 — sinh^(r) 

are a pair of normalised null vector fields orthogonal to 

Proof. Since g = h + is a direct product it is sufficient to prove 
the analogous assertion for h. For any x £ the tangent space of 
is spanned by d^. It is clear that h{d^,d^) > 0 if and only if r < 
sinh“^(l) = ln(l + V^). We first calculate N±. From 

0 = h(N±,d^) = sinh2(r)(l - sinh2(r))iV| - V2smh.Hr)N^ 
we get equation 

0^g{N±,N±) 

= -(JV|)2 + + sinh^(r) ((1 - sinh^(r)))JV^)^ 

— 2 sinh^ (r) 

1 + sinh^(r) 



= mf- 



1 - sinh^(r) 






implies therefore 






ds ± 



cosh(r) 



-dr 



^/2 



j -= zur -r 2/ \ 

sJl-smL\r) 1 - smh (r) 



N: 



±- 



We normalise N+,N- by demanding {N+,N-) = -1. This is equivalent 
to 

-1 = ( - 1 - + sinh^(r) (l - sinh^(r)) ArfTT 

^ l-sinh^(r) l-smh^(r) 



- 2\/2sinh^(r) 

2cosh^(r) 

1 - sinh^(r) 



\r) 

V 2 

1 -sinh^(r) 



){Nif 
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whence 



N± = ^{ {-N± ~ V2sinh^{r)Nfj ds + (JV^) dr 

(-V2sinh2(r)7Vi + smh2(r)(l - sinh2(r))iV|) d^) 



+ 






+ 






/l - sinh^(r) V2sinh2(r) 



V2cosh(r) cosh^yr — sinh^(r)^ 



d5 ± — =dr 
\/2 



+ "l"h=(r)(l -3lnh"(r)) ' 
v/2cosh(r) cosh(r)y'l-slnh’(r), 

cosh(r) 



d<^ 



d5 ± —=dr 



\/2y^l - sinh^(r) ^ 



We can now calculate the covariant derivatives VN±. Since we are only- 
interested in trr,,Ax±) = and is spanned by we 

need only calculate the covariant derivative in direction 0^, 

trT.,.(Vivi) = . 

In order to calculate the Christoffel symbols note first that the 

inverse of h is given by 



h’^ — — 



1 “ sinh^(r) 
cosh^(r) 



(^5)^ + (9r)^ + — 



1 






2V2 



sinh^(r) cosh^(r) ^ ^ cosh^(r) 






It is now a simple exercise to compute 



r; 



(pip — 2^ (^dph‘pp d" dphp-p Ofhpp'j 



- l-h^'^drhpp = ~9r(sinh^(r)(l - sinh^(r))) 
= - sinh(r) cosh(r)(l ~ 2sinh^(r)), 



^ip(p 2^ ipif^ips H“ ^cph$p ^s^tpcp) “b 2 diphpp — Ophpp) 
= 0 . 

Hence we get 



trT.,.(ViVi) = - 
- ± 



sinh^(r) cosh^(r) 

1 — 2sinh^(r) 
\/2sinh(r) cosh(r) 



* sinh(r) cosh(r)(l - 2 sinh^(r)) 

_ 
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and the assertion follows now directly from Equation (9.2.2), I 

Proposition 9.4.3 implies that the surfaces ^jn((i+V 3 )/V 2 ) spacelike 
and have vanishing mean curvature vector field H. 



9.4,2 Newman’s example 

Consider the following partial compactification of the Godel solution. 
{M,g) — (R^ X + dz^) where 2 ; is the natural coordinate of 5^. 
(Here we view as the subset [0,27r] c R, where the points 0 and 
27 t are identified). Clearly, (M,g) is locally isometric to {M^g). Observe 
that the corresponding sets are compact (and diffeomorphic to tori). 
In particular, the surfaces ^^in((i+V 3 )/v^) closed (but not strictly 
closed) trapped surfaces. It is plausible that a suitable deformation of 
(M, g) will result in compact surfaces with past pointing, timelike mean 
curvature vector field. This would give an example which is causally 
geodesically complete and satisfies all assumptions of Theorem 9,3.1 — 
with the exception of chronology. In order to preserve the causal struc- 
ture of the Godel spacetime we will deform g by multiplying it with a 
conformal factor C : M ^ R. We need to calculate the change of the 
mean curvature vector field when g is replaced by ^ = O^^g. 

Lemma 9.4.3. Let (M^g) a pseudo- Riemannian manifold andO: M 
R. Let g = Q^^g and V (resp,, V) the Levi-Civita connection of g (resp.^ 
g). Then for every vector field U and every 1-form X on M we have 

VfjA = VijX ~ (dI2([/)A + \{U)da - A(gradI2)^(?7, •)) . 

Proof Denote the difference tensor of V and V by C, G(f7, V) ~ VjjV — 
Vjr/F. Let (rr^, . . . , x'^) be a normal coordinate system with respect to g 
centred at rr € M. Then we have at — 0 at a: and therefore 

(C(a,,,a.,))‘ = (Va,,a„)‘ - = (%;?)* = /?■ 

= {d^‘Sl3 + 

at X, Since this is a tensor equation we can infer 

everywhere. From = Q'^gij and ~ Q~‘^g^^ we get 

^i9jk ~ ^^^^i^')gjk 
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and therefore 



ct = + aa^j^ - aa^in) . 

The assertion follows now directly from VA = VA - A(C(-, •)), I 

Lemma 9.4.4. LetT§^f be the projection ofT^^f. to M andU^V be open 
neighbourhoods of 7}^^ which have compact closure of 7§^f^ and satisfy 
U CV, Then there is a function Let f : M R"'* such that 

(i) f{x) € (-i, |), 

(a) supp/ C V, 

f\u only depends on t, and 
(iv) dsf{x) < 0 for all x e 

Proof This is a simple application of Lemma 2.1.7. I 



Proposition 9.4.4. Let f ^ ln((l + V^)/s/2), tq € (0,1); / be the 
function provided by Lemma 944 o^'o.d set Or(x) = 1 + rf{x) for all x, 
T € [0, To). Then the family pr := {L2tY 9 is a deformation of g which 
depends smoothly on r and satisfies go = 9^ Furthermore, there is a 
Ti € (0,To) such that each g^ (r € (0,ri)y) contains a strictly closed 
trapped surface. 



Proof The first assertion is trivial and we only have to show the existence 
of a strictly closed trapped surface for r > 0 sufficiently small. Consider 
the surface We must show that tr.j„ are both negative. Here 

N± = denotes a pair of normalised null vector fields orthogonal 

to T§^f. We can restrict attention to U and obtain 



+r^V5^(iV±)'(a,) 

= L^'O 



=0 



g^\VQj,N^)\d,)-Q-\~AQ{N^))ge^4 

ih 

cosh^(r) 



+ 






V2 sinh(r) cosh(r) 
+ Q-'^dn{N±) 
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= n-‘(± 

\ v2sinh(r) cosh(r) cosh (r) V2cosh(r) / 

Inserting r —f this gives 



0 ± = n 



-2 



2 1^1 -sinh^(r) 

cosh^(r) ^ ^ v^cosh(r) 



It is clear that for r small enough the genericity and timelike convergence 
conditions are still satisfied. Hence for r small enough the spacetime 
is complete but satisfies all assumption of Theorem 9.3.1 with 
the exception of the chronology condition. 

It is possible estimate which kinds of causality violation can invalidate 
Theorem 9.3.1. In the proof of Theorem 9.3.1 the future horismos E'^{T) 
of the strictly closed trapped surface T played a significant role. In a 
spacetime with chronology violations (and in particular in our example) 
this set is in general empty. However, there is a generalisation of the 
non-global features of a horismos. 

Definition 9.4*2. Let (M, g) be a time oriented spacetime and T> be a 
compact set which is achronal in some neighbourhood ofU ofV, 

(i) Let'y: [0,6) be a future directed future inextensible null geodesic 
starting in V, A point x = ^{t) is called a focal point; if 

(a) for all 7 (^- 1 -) (t^ > t) there is a timelike curve from V to 7 (t+ 
arbitrarily close to 7 ; 

(b) There is not any t^ <t such that T> and 7 (t_) can be connected 
by timelike curves arbitrarily close to 7 . 

(ii) Denote by^^’ ihe maximal geodesic prolongation of the 

generator of E~^{V,U) with starting point x = jx{^) which does 
not have a focal point The generalised future horismos of V is the 
closure e'^('D) of the set {jxit) € M : x 6 P}. 

(Hi) The generalised future focal set of V is defined by 

f^{V) = {y e e'^(P) : y is future endpoint 

of some generator ^x ofe'^{T))}, 

e~ (P) and f~ (P) are defined analogously. 

It is clear that for every spacetime and every compact set P we have 
£’"^(P) C e+(P) and P C e"*'(P). The future horismos is always a 
Lipschitz hypersurface with induced degenerate metric. A similar prop- 
erty is also true for e'^(P). 
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Lemma 9*4,5, Let fi C e^(V) be a causal curve. Then fj, is either 

— a subset of a null geodesic generator of e'^{V) or 
~ a causal curve contained in f'^i'D) or 

— a concatenation a subset of a null geodesic generator and of a causal 
curve contained in f~^{V) . 

Proof. If /i is a non-trivial curve in e+(P) which is not a part of a 
generator of e‘^{V) and does not lie in then it is intersected 

transversely by some generator of e“^(P). Hence there exists a broken, 
causal curve A in e^{V) \ f'^(V) with past endpoint in V. Let y € A be 
a point s after this break and jz the generator of e+(P) with future 
endpoint y. Using our broken causal curve we see that there is a timelike 
curve from V to y arbitrarily close to 7^ . This gives a contradiction to 
the definition of e+(P), I 

Hence any closed causal curve arbitrarily close to the generalised future 
horismos must lie in f'^{'D). This is exactly what happens in the example 
of Newman. The null geodesic generators of end in a caustic 

/■^(^,r) which is ruled by closed null curves of the form s = const ^ 
r = const, 2: = const. Observe that these curves are not null geodesics. 
The existence of these curves is basically the reason why Theorem 9.3.1 
fails in the presence of causality violation. Notice that we can slightly 
generalise our example such that f~^{T§^i^) is not ruled by closed causal 
curve but only by almost closed causal curves. We simply replace the 
identification {s^r^(p + 2'k.^z) = {s^Vy(pjZ) by an identification {s^r^ip + 
27T, = {Syr,(p,z + a) such that the quotient a/27r is irrational. It is 
clear that the curves 7 locally defined by 5 = const, r = const, ^ — const 
are not closed but satisfy instead: 

For each j{t) and each (small enough) neighbourhood V of j{t) 
there is a > t such that the segment of 7 between 7(t) and 
7(t+) leaves V and then re-enters this set. 

All other properties of our example are unchanged since the new space- 
time is locally isometric to the old one. In order to state a theorem which 
justifies the claim that the only impediment to a version of Theorem 9.3.1 
in the presence of causality violation is the possible existence of almost 
closed causal curves in f'^ (T) we need the following technical definition. 

Definition 9.4.3. Let j be a curve and choose any Riemannian met- 
ric h on M. Let (jl: (a, 6) be a reparameterisation of 'y which satisfies 
h(fi^fi) = 1. We call 7 almost closed if there exists a vector u € 
{jji{t) : t £ (a, 6)} such that for every neighbourhood iX ofu iuTM there 
exists a deformation A of // in tttm (A1) which yields a closed curve and 
satisfies \{t) G A(^) ^ Af- 

Observe that this definition is independent of the choice of h. 
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Theorem 9.4.1* A spacetime {M^g) is not causal geodesically complete 

if 

(i) the timelike convergence condition and the genericity condition 
holdf 

(ii) there exists at least one of the following: 

(a) a (locally spacelike) strictly closed trapped surface T, 

(b) a compact achronal set T without edge, 

(c) a point x such that along every past (or every future) inexten- 
sible null geodesic from x the expansion of the null geodesics 
starting at x becomes negative, 

(Hi) neither f^iT) (respectively, f‘^{{x}) nor any f~{V), where V 
is a compact topological submanifold (possibly with boundary) with 
VoT ^ 0 (respectively, x ^V) contains any almost closed causal 
curve that is a cluster curve of a sequence of closed timelike curves. 

This is a proper generalisation of Theorem 9.3.1. The technical condi- 
tion (iii) just states the situation which we have already anticipated by 
analyzing Newman’s example. The proof of Theorem 9.4.1 is far too 
technical to be reproduced here. It basically consists of a cutting and 
pasting procedure (Kriele 1990). 

The closed trapped surface in Newman’s counter example has the topol- 
ogy of a torus. In a physically realistic collapse scenario of a star one 
would rather expect that there exists a closed trapped surface of topol- 
ogy 5^ surrounding the collapsing star. This motivates the following 
conjecture: 

Conjecture Q.f.l. A 4-dimensionaI spacetime {M, g) is not causal geodesi- 
cally complete if 

(i) the timelike convergence condition and the genericity condition 
hold, 

(ii) there exists a strictly closed trapped surface of topology 5^. 

In Newman’s example the generalised future focal set of 7^ in((i+ v^)/ 2 ) 
is generated by closed null curves. This is impossible if T has topology 

52, 

In spite of this small piece of evidence and the importance of Conjec- 
ture 9.4.1 for our interpretation of singularity theorems it is completely 
open whether this conjecture is true or not. 



9.5 Strength of singularities and cosmic censorship 

In this section we will investigate the character of the singularities pre- 
dicted by Theorem 9,3.1. We will also give an example (cf. Sect. 9.5.1) 
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which shows that the theorem of Hawking and Penrose may only im- 
ply the existence of singularities^^ which are so weak that the energy 
density exists in a disti ibutional sense. Our example is not very phys- 
ical — for a start, it is 3-dimensional rather than ^-dimensional. On 
the other hand, it is a good test case for the mechanism behind the 
singularity theorems. 

The existence of incomplete causal geodesic does not imply that there 
is a singularity. This is the reason why “singularity theorems” are often 
referred to as “incompleteness theorems” . The standard counter example 
in general relativity is the Taub-NUT spacetime (cf. (Hawking and Ellis 
1973, chapter 5.8). The following two-dimensional example is especially 
simple. 

Example 9,5.1 ( Clifford- Pohl torus). Let (M,g) = (K^\{0}, p~^dudu). 
Then the curve 7(t) (3-^,0) is an incomplete geodesic and the map 

^ : (w, v) (2u, 2v) is an isometry. Defining 

X r^y 3k eZ with 'ip^(x) = n: x [x], 

we obtain a compact Lorentzian torus n^g). The curve 7 t( 7) is an 

incomplete lightlike geodesic in this compact and therefore non-singular 
spacetime. 

While Example 9.5.1 shows that there exist spacetimes which are non- 
singular and geodesically incomplete, so far there is no example of such 
a spacetime which satisfies the assumptions of Theorem 9.3.1. In fact, 
it is believed that the incomplete geodesics predicted by the singularity 
theorems are of a different nature. The justification for this expectation 
is the fact that a non-compact or empty “Cauchy horizon” plays an 
important role in the proof of the theorem of Hawking and Penrose. 
Since the Cauchy horizon is necessarily closed, one would expect that a 
Cauchy horizon due to compactly imprisoned curves (such as in Example 
9.5.1 above) is compact and non-empty This motivates the following 
conjecture. 

Conjecture 9.5.1. If the assumptions of Theorem 9.3.1 are satisfied, then 
there exists a causal inextensible incomplete geodesic which leaves every 
compact subset in both future and past direction. 

Using an approximation argument one can show that the singularity pre- 
dicted by the theorem of Hawking & Penrose is not just due to g merely 
being (7^" instead of C^. There also exist upper bounds on the diver- 
gence of the curvature along any incomplete geodesic 7 which are large 
enough to allow for the possibility of strong curvature singularities,i.e, 
curvature scalars which not only diverge but whose integral over an ap- 
propriate spacetime region diverges as well. Unfortunately, we have no 
curvature estimates for anything in between these two extremes. 
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If, in general, the singularity predicted by Theorem 9.3.1 were only 
weak, this theorem would only predict the existence of “shockwaves” and 
we would lack evidence for the existence of serious singularities such as 
black holes. The following conjecture (formulated by Hawking & Ellis 
(Hawking and Ellis 1973) with respect to a slightly different singularity 
theorem) is therefore central to our interpretation. 

Conjecture 9,5,2, Assume that (M, g) is chronological, satisfies the time- 
like convergence and the genericity condition and contains a closed trap- 
ped surface. Then there exists an incomplete, future inextensible geodesic 
7 and a neighbourhood W of 7 such that Vol(W) < 00 and /(-R)m = 00 
for some curvature invariant f{R) which is (positively) homogeneous in 
the Riemann tensor, f{XR) — \X\ f{R) for all A > 0. 

The homogeneity condition is important because otherwise we could 
take an appropriate power of a weakly diverging curvature invariant in 
order to obtain a diverging integral: While is finite, the integral 

fo 

Hawking & Ellis state that while they are convinced of the validity 
of such a conjecture"^ they are unable to prove it. In Sect. 9.5,1 below we 
will give a 3-dimensional example which indicates that Conjecture 9.5,2 
may not be true in the present form. 

If there is a singularity in our universe, we would like to interpret 
it as a black hole, i.e., we would hope that it is invisible — just as the 
singularity in the Schwarzschild spacetime. Otherwise we would not have 
a chance to globally solve Einstein’s equation as a Cauchy problem since 
the singularity (whose data are unknown) would influence the geometry 
of spacetime to its future. There are also important theorems for our 
interpretation of black holes which need a assumption similar to cosmic 
censorship. The prime example is the “area theorem” due to Hawking 
which states that the area of black holes can only increase^) (Wald 1984, 
theorem 12 . 2 . 6 ). 

Since it is easy to find examples of inextensible Lorentzian manifolds 
which contain visible (or “naked”) singularities, additional assumptions 
on spacetime must be made in any conjecture which “censors” naked 
singularities. The following conjecture is due to Penrose. 

Conjecture 9,5,3 ((strong) cosmic censorship). If (M^g) is qualitatively 
stable and its matter model T is physically reasonable then no future 
incomplete, future inextensible causal geodesic 7 lies in the past of any 
X e M. 



^ They state their conjecture with respect to a different singularity theorem. 
^ This is only true in classical general relativity without taking quantum 
effects into account 
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It is not sufficient just to demand a “physically reasonable” matter model 
because of the Reissner-Nordstrom solution, 



^ = - 1 



2m 
h 



.V / ^ 2m 



-1 



dr^ 



H- (d^^ + sin^ 0d(/?^) 

The energy momentum tensor T — ^(Ric - (Scal/2)5f) is given by 



T = 



S'Trr'^ 

where we have set 



(c/^ -Q'’ + r‘^ (d^2 ^ gjjj2 j ^ 




du 



Q = 




2m V 
r r^y 



dr. 



This spherically symmetric spacetime satisfies Einstein’s equations for 
an electromagnetic field (cf. Lemma 7.4.1) which is certainly “physically 
reasonable”. On the other hand, unlike in the case of the Schwarzschild 
solution (e = 0) where the hypersurfaces r = const 1 are space- 
like, the hypersurfaces r = const < 1 in the Reissner-Nordstrom so- 
lution are timelike. It is easy to see that there exist points € M 
such that I~^{x) Pi I~(y) contains timelike future inextensible curves 
which approach r ~ 0. (For a more thorough discussion of the Reissner- 
Nordstrom spacetime including its global properties cf. (Hawking and El- 
lis 1973, chapter 5.5)). This Reissner-Nordstrom spacetime therefore vi- 
olates Conjecture 9.5.3 if it is qualitatively stable. Calculations by Simp- 
son and Penrose (1973) and McNamara (1978) indicate that this is not 
the case for an intuitive notion of stability. It is generally believed that 
a generic, physically acceptable perturbation of the Reissner-Nordstrom 
spacetime results in a spacetime which is qualitatively more similar to the 
Schwarzschild spacetime, even though the Reissner-Nordstrom spacetime 
itself can be thought of as a perturbation of the Schwarzschild spacetime. 



9.5.1 A simple, 3- dimensional example 

Let {M,g) be a 3-dimensional spacetime and assume that the energy 
momentum tensor is given by T == ell^ (g> where U be the spacetime 
velocity of the dust particles and e their energy density. We are seeking 
solutions of Einstein’s equation 

Ric — -^Scalp = SttcU^ 0 

where e: M R is a function. In general, this is still too difficult even 
though we assume n = 3. Assuming that there is a foliation of spacelike 
hypersurfaces orthogonal to U simplifies the problem dramatically. 
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Lemma 9.5,1. The vector field U is irrotational (Le., dU^ ~0) if and 
only if the Pfaffian system {U^} is integrable. 

Proof. By Lemma 2.5.8 the integrability of {U^} is equivalent to the 
equation dU^ AU^ — 0. Recall from Lemma 5.2.1 that U satisfies the 
geodesic equation VjyC/ — 0. Let X be any vector field. Then we have 
dU\X,U) = XxU\U) - VuU^(X) = g{VxU,U) - g(VuU,X) = 
0 — 0 since g{U^U) = —1. It follows that dU^ is completely determined 
by evaluating it on vectors orthogonal to U. Let v^w be two vectors 
orthogonal to U. Then we have dU^ A U^{v^ w, U) = ~dU^(v, w) and the 
equivalence follows. I 



Lemma 9.5.2. Let {M^g) be an irrotational^ 3- dimensional dust space- 
time and let S be a hypersurface which is orthogonal to U. If atp e S the 
second fundamental form of E is not a multiple of the metric, then p has 
a neighbourhood with coordinates (t, x, y) such that g — — dt^ + T/^dx^ + 
W’^dy’^, where V, W are functions of t, x, y and T = e{t, x, y)dt 0 dt. 

These coordinates are unique up to transformations of the type x 
X{x), y Y{y), t t + const, and interchanging of x and y. 

Proof. Since {U^} satisfies dU^ = 0 there is a function t with dt — U^. 
We can write g = x^)dx'^dx^, where for each t 

the bilinear form *, •) is a Riemannian 2-metric and E is given by 
t = to- Since is not umbilic at p, there exists a frame {61,62} of 
in a neighbourhood of p such that ^'^^g and both diagonal 

with respect to this frame. Let be the dual frame. It follows 

that there exist coordinates (x, y) such that at t = to both ^^'Ig and 
are diagonal with respect to dec, dy, Ij^ feet, we only need to show that 
there exist multiples aiei, a2C2 of ei, 62 such that [aiCi, a2C2] = 0. This 
is equivalent to dai(ei+i mod 2) 4- (-l)^o:u;^([ei, 62]) = 0 (no summation 
over i) which is a system of ordinary differential equations and can be 
solved by Theorem 2.4.1. With respect to the coordinates (t,x,y) the 
equations Tij = 0 {i,j e {x,y}) imply 






+ j((tr(9,(2V 



))'-3|a/2). 



( 2 ), 



'9ij- 



Since at t = to the bilinear forms and are diagonal, it follows 

that the right hand side is also diagonal at t = to. Since the system 
has a unique solution and there exists a solution when are 

simultaneously diagonal, must be diagonal for all t. Hence we 

have existence. For uniqueness observe that the frame {61,62} is unique 
up to multiples and permutation, and that the coordinate t is already 
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chosen so that it is unique up to an additive constant. Thus any other 
coordinates Y with the same properties must satisfy either || dx 

and dy |1 dy or dx || dy and dy || dx> This proves the lemma. I 

Corollary 9*5.1* Let be a 2-dimensional Riemannian mani- 

fold and k be a symmetric ( 2 ) tensor field which is not proportional to 
Then the initial value problem for irrotational, 3-dimensional dust 
spacetimes with initial data (17, k) reduces to a constrained system 
of ordinary differential equations. 

In Theorem 9,5.1 below we will summarise properties of generic, irrota> 
tional, 3-dimensional dust spacetimes using standard differential geomet- 
ric terminology. Consider a 2-dimensional Riemannian manifold {X^g^) 
and denote the set of unoriented lines in TX by Pi7. Then there exists 
a natural map : Pi7 — PI7 which maps an unoriented line I e FX to 
the line orthogonal to it. We call a section I of PI7 nowhere geodesic if 
for any local, non- vanishing vector field L with L{p) e l{p) Vp we have 
gz:{L‘^ i^L) ^ 0 Vp. This condition does not depend on the chosen 
representative L. It is a local but not necessarily a global genericity condi- 
tion on L Locally, this condition is slightly stronger than demanding that 
I does not have any local integral curve which is a geodesic. Let (X^gx) 
be a 2-dimensional, spacelike submanifold of a Lorentzian 3-manifold 
(M,p) with future directed normal n and second fundamental form 
k{X^ Y) = n). We denote the bilinear form associated with the 

square of the corresponding matrix by i.e., {k‘^)ij = {gEY^hikjm- 
We call the eigenvalues ^2 of k with respect to gx the principal cur- 
vatures of X and the (unoriented) lines spanned by the eigenvectors the 
principal directions of X, 

Theorem 9.5.1. Let (R^, ^^^g) be a Riemannian 2-manifold andl: R^ — »• 
fR^ which maps each point p G R^ into an unoriented line l(p) C TpR^ 
and assume that I is nowhere geodesic. Let C C F? be a smooth curve 
which divides R^ into two disconnected regions such that I and l-^ in- 
tersect TpC transversely at each p e C, Finally^ let KijK 2 ‘^ C — > R be 
smooth functions, 

(i) Let the set of points p G R^ such that the integral 

curves of I and l-^ through p intersect C, There exists an irrotational^ 
3- dimensional dust spacetime (M,p) and an isometric embedding 
i: X C M such that the second fundamental 

form k of X in M satisfies 

(a) the principal directions of X are given by i^l, i^l^^ 

(b) along C the submanifold X has principal curvatures ki — Ki^ 

C can be chosen such that 2)(C, I, l^) ~ B?, Then (M,g) is inexten- 
sible if (R^, is so. 
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(a) For any p £ E let 

k-{p) = min(fei(p),fc 2 (p)) md fc+(p) = max(/ci(p), ^(p))- 

The world line of the dust particle through p ends in a curvature 
singularity in finite proper times ^/^+(p) > ^ > k~{p)> 

at finite proper time if (p) > 0, at finite proper time 
if k^{p) < 0. There are no other singularities. 

(Hi) All singularities are weak in the se nse that for all open sets 
U with bounded volume, vol{U) — det{gab)dtdxdy < oo, the 
spacetime average of the energy density, 




is also bounded. 

(iv) The spacetime is non-singular if and only if K\ ~ K 2 = ^. 

(v) For generic initial data strong cosmic censorship is violated, pro- 
vided one regards the solution as ^^qualitatively stable’^ and ^^physi- 
cally reasonable”.^ 

(vi) Generically, the data ^^^g, I, K\, K 2 parameterise the set of local, 
irrotational, S- dimensional dust spacetimes. 

Properties (iii) and (ii) hold true in a more general context (Kriele and 
Lim 1995). 

Proof of Theorem 9.5.1. (i): We can choose coordinates (x, y) such that 
at each point p € the Gaussian vectors dx, dy span I and (see the 
proof of Lemma 9.5,2). The metric is diagonal in these coordinates. 
Let Vq ~ ^0 ” of Lemma 9.5.2 we can set 

S = {{t,x,y) : t = 0} and assume that the dust-metric is given hyg — 
-dt^+V^{t,x,y)dx‘^ + W^{t,x,y)dy‘^, V{0,x,y) = Vo{x,y), W{Q,x,y) = 
Wo{x,y). Since the constraint Tty = 0 is equivalent to 

dtdy dt dy 

we obtain either ^ = 0orW ~ u)^, where w is some non- vanishing 
function of {x,y). In the coordinates {x,y) we can write L{x,y) = 
f{x,y)dx and (L{x,y))^ = h{x,y)dy, where /, h are non-vanishing 
functions. The condition that I is nowhere geodesic reduces then to 

fhVo^ ^ 0. Thus we have ^ # 0. Since 0 = Tyy f ^ 

we can write V {t, x, y) ~ Vq{x, y) + tq{x, y). Now the equations Ttx = 0 
gives 

® The relation of these solutions and cosmic censorship are further discussed 
in (Kriele 1997). 
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Fig. 9.5.1. The singularity structure of spacetime. The case ki^k^ < 0. The 
y-component of spacetime is suppressed. The singularity A is given by 1 -h 
t {dVo/dy)~^ {dq/dy) = 0 and the singularity B is given by Vo-\-tq = 0. Observe 
that at the singularity A the light cone degenerates in the y-direction and that 
at B degenerates in the rr-direction. Hence there exist future directed timelike 
curves emanating from the singularity and cosmic censorship is violated 



0 



dw dq 



dw dVo 
dx dy ^ 



w 



d^Vo d^q 

dxdy^ ^ dxdy 



Vo. 



( 9 . 5 . 7 ) 



Re-expressing w{x^y) by Wo = we obtain the linear, hyperbolic 
partial differential equation 



dxdy 




dWp 

dx 




dxdy j dy 






q = 0 



( 9 . 5 . 8 ) 



for the function q{x^y). The lines I, l-^ are transverse to C. Moreover, it 
follows that each integral curve of I and l-^ intersects C at most once. In 
fact, if a coordinate line x = xp would intersect C twice then there would 
exist a point (xm^l/m) of C in between these intersection points which 
has locally maximal distance to (xo^ym) with respect to the flat metric 
dx'^ 4- . At this point dy would be tangent to C in contradiction to our 

transversality assumption. We will now show that for given initial values 
this differential equation has a unique solution in 2)(C, I, i-^) by reducing 
it to an appropriate system of hyperbolic differential equations. Let 



Alternatively, we could directly appeal to standard theorems. Since the 
symbol of the hyperbolic equation (9.5.8) is constant it has a unique, global 
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= X -\-y and = x — y. With respect to these variables Equation 
(9.5.8) reduces to an equation of the form 



d^q 

(5ti)2 



d'^q 



= h 






where h: K® — E is a suitable function. Setting r = and s = 
we see that Equation (9.5.8) is equivalent to the hyperbolic system of 
equations 



dq ^ dr 
dri dr I 



ds 

dr2 



+ /i(ri,T2,g, f, S), 



ds __ dr 
dri 8 t2 * 



The characteristic directions are given by the left eigenvalues of the ma- 
trix 




h = (1,0,0), I 2 = (0, 1, 1), I 3 = (0, 1, -1). These vectors are linearly in- 
dependent and we can apply Corollary 7.3.2 together with Remark 7.3.1 
in order to obtain a solution for given initial values. The initial values 
for q can be calculated from the initial values for q. If we parameterise 
C by a curve s A(s) we get 



dqo X 
ds 



||(dri(A)f + dT2(A)s) 



and another linear combination for the normal derivative of q which 
depends on the coordinate expression for Hence we can calculate 
the initial values for (g, f, s) if we know q^c its normal derivative. 



|c 



we 



Since this normal derivative can be calculated from q^c 
have as our initial data q\c = KiVo\c and (^) c 



= 0 and Txx = 0 hold automatically this leads to a solution of 
Einstein’s equation which is given by 







(9.5.9) 



It follows immediately that the principal curvatures are given by 
ki(x,y) = q{x,y)/Vo{x,y) and k 2 (x,y) = 



solution in S(C, 1, 1"^) for the initial value problem with respect to the initial 
curve C C S (Garabedian 1986, section 4.2). 



/ ^ ^ 
\dy) dy' 
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If one chooses C = {{x,y) : x - y} then Since the 

solution is inextensible in t-direction {M,g) is inextensible provided 
D{C,l,l-^) = and is inextensible. 

(ii): The energy density is given by 



where 



e{t,x,y) 



Ejx, y) 



( 9 . 5 . 10 ) 



1-1/ \ 

E{x,y) = ~q + 




dWp 

dy 






dVo d^Vo 
dy dy^ 



Wq-" 



dVp d^Wo 
dy dx^ 



Vb-^ITo-^ 



+ 



dVo 8Wq 8^0 
dy dx dx 



yo“^Wo“^ 



Since g is given by Equation (9.5,9) we have 4- W^dy^ 

and k = qVodx'^ -f Wq follows from Equation (9.5.10) 

that e may become infinite at Vb 4 ~ 0, ^ ~ ^ that 

generically it will be infinite at these points. The first part of (ii) follows 
immediately. To see that there is no other singularity observe that for 
3-dimensional spacetimes the Riemann tensor is completely determined 
by the energy momentum tensor and that therefore all Riemann tensor 
components are bounded where e is bounded. 

(iii): This follows since 



€y/~ det{gab) = eWb 



^EWo 




(Vo-i^tq) 






is finite. 

(iv) : Since the initial data for Equation (9.5.8) are given by q\c = 

KiVo and “ -^ 2 ^ the claim follows from the uniqueness theo- 

rem for PDEs of the type (9.5.8). 

(v) : Since gu = -1 and gxx or gyy converges to zero it is easy to see 
that at these singularities strong cosmic censorship is violated unless the 
singularity is given by ^ = const, (cf. Fig. 9.5.1). 

(vi) : Implicitly we have assumed that the second fundamental form 

and the metric can be diagonalised with respect to a smooth frame. This 
may not be possible at umbilic points but points at which this problem 
occurs are isolated and therefore not important for local genericity. Since 
^ ^ 0 is also a local genericity condition, locally almost every irrota- 
tional dust spacetime can be obtained in this way. Given C, our initial 
data are invariants. Hence (vi) follows. I 
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Corollary 9.5.2. Almost all 3- dimensional j irrotational dust spacetimes 
can be obtained by quadratures. 



Proof, In Theorem 9.5.1 we have solved Einstein’s equation as an initial 
value problem where Vb? j given. Now we consider Vb, 

W^0]x=jco given initial data. Then we can solve Equation (9.5.8) and 
obtain the explicit solution 

Vodq/dy - qdVo/dy 



Wq == Wo\x=xo{y)^' 



f 

it/XO 



1^0 






xo dV^/dy d^V^jdxdydqldy - d^qjdxdydVojdy 

(9.5.11) 



Remark 9,5,1, In Theorem 9,5.1 we have only considered the case where 
^ ^ 0. There are additional restrictions at points with — 0. It 

follows from Equation (9.5.8) that at these points either = 0 or all 
y-derivatives and mixed derivatives of V vanish. For completeness we 
will discuss both cases in more detail. 

If does not vanish identically then we can use Equation 9.5.11 
to solve Einstein’s equation. Since Wq is smooth and does not vanish, 
the integrand in Equation 9,5.11 must also be smooth. It follows that 
in a neighbourhood of {(o;,y) : ^ = 0 at {x,y)} the y-derivative of q 

must satisfy Qy = Jvo jdxdy (''^^ + ^ function of 

(rr,y). If this is satisfied then we obtain a local, non-singular solution. 
Otherwise we simply have specified singular initial data. 

The non-generic case ^ = 0 can be easily solved. If ^ = 0 then 

Txx = '^yy ~ Thus we can write V{t^x) = Vo(x) + 

tq(x) and W{t^x,y) — Wo(a:,y) +ts{x^y). Now Ttx = 0 implies 




dWo 

dx 



q^O, 



(9.5.12) 



Since the initial metric induced on S is given by 

9s = Vo^{x)dx‘^ + Wo^(x,y)dy'^ 

we have Vb(a;) ^ 0 Vx. Thus we can use our coordinate freedom to 
normalise Vb(a;) = 1. Equation (9.5.12) can be immediately integrated 
for any given q, Wq . We have automatically Txy — 0 and hence for any 
functions q{x)^Wo{x,y),w{y) we obtain a solution 

g = -dt^ + (1 + tq{x))'^dx‘^ + {Wo{x,y) + ts{x,y)fdy'^, 

were s{x,y) = '■^d^+w{y). The space of solutions is param- 
eterised by the functions q(x)^Wo(x^y)^'w{y) modulo coordinate trans- 
formations (x, y) 1 -^ (x 4“ xo, y (y)). The energy density is given by 




420 9. Singularity theorems 



t{t,x, y) 



d'^Wn 



+ qs 



{l+tq){Wo^tsY 



(9.5.13) 



Hence we obtain the same type of singularities as in Theorem 9.5.1. 

We can express the metric provided by Theorem 9.5.1 (or the preceding 
remark) in a more geometrical form. 



Corollary 9.5.3. Let (M,p) be a 3- dimensional spacetime. If there ex- 
ists a timelike unit length vector field U and a function e such that 
T — eU'^ 0 and dU^ a = 0^ then there exist coordinates (t, x, y) 
with = dt and 



g = -dt^ + e^^^^^y\l-^tki{x,y)fdx^ 

+ +tk2{x,y)Ydy^ 

where a, b are free functions and fci, satisfy 
dyki = {k2 — ki)dya^ 

d^dyki ~ (fe2 - ki){ds;dya ~ dyud^b) - dxkidyO. 

The energy density is given by 

~ ^1^2 d- e^ei • ei • e~^ d- e^e2 • C2 • 

(1 + tfci)(l -j- 1 /: 2 ) ^ 

where ei = e~^dx, 62 = e~^dy. 

We will now show that our examples do not satisfy the genericity con- 
dition. It turns out that otherwise they would provide counterexamples 
to Conjecture 9.5.2 (cf. corollary 9.5.5 below). 

Lemma 9.5.3. Let (M, g) be a 3-dimensional pseudo- Riemannian man- 
ifold. Then the Riemann tensor is completely determined by the Ricci 
tensor and given by 



Rijkl — 9j[k^l]i) ^^^9i[k9l]j' 

Proof. Proposition 4.3.2 implies that for every pair of tensors Gx^S^ G 
sym(T2 (Ta;M)) there is a metric g such that gx = Gx and Sx = RiCa;. 
In fact, we can choose coordinates such that {Gx)ab = VaSab 

(no summation) where r/a G {-1, 1}. Then we simply set 

1 ^ 1 

9ab{^ ) == {Gx)ab “ ^ '^i^x)ab9c^cd^^^^ • 

c,rf=l 

Prom the first, the third and the fourth symmetry in Proposition 4.3.1 
we obtain that at a given point x the Riemann tensor of a 3- dimensional 
pseudo- Riemannian manifold is already specified by the 6 components 
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^1212i -^1213) ^1223} ^^1313i -^1323? -^2323- 

Since sym(T^(Ta.M)) is a 6-dimensional vector space and every tensor 
Sx € sym{T 2 (TxM)) can be realised as the Ricci tensor of a metric, 
the map ti: Rx tr(jRa,) = Ric^ is linear isomorphism. The tensor 
r(Ric)ij-fc^ = 2{gi[kRi]j - 9j[kRi]i) “ tr(Ric)^^[j^5^/]^' satisfies the equations 
given in Proposition 4.3.1 and tr(r(Ric)) = Ric. Hence it is the Riemann 
tensor corresponding to Ric. I 



Corollary 9.5.4. If (M^g) is a 3- dimensional^ irrotational dust space- 
time then the genericity condition does not hold. 

Proof The Ricci tensor is given by Ric = e(ei‘’®ei^-he2^®e2‘'). It follows 
directly from Lemma 9.5.3 that the components Rtjkt (j,k € {t^Xyp}) 
of the Riemann tensor vanish. Hence for any fixed numbers {xo,yo) the 
genericity condition is violated along the timelike geodesic t (t , Xq , ^o) • 



Corollary 9.5.5. There is a 3-dimensional spacetime (M^g) which 

(i) is chronological, 

(a) is geodesically inextensible, 

(in) satisfies the timelike convergence condition 

(iv) contains a closed trapped surface, 

(v) and contains an incomplete future inextensible geodesic 7 and 
a neighbourhood U of 'y such that Vol(W) < 00 and f/j> < 00 for 
any polynomial curvature invariant f which is linear in the Riemann 
tensor. 

Proof Consider a spacetime as given by Corollary 9.5.3. In order 

to obtain a closed trapped surface we let M — R x x R = {{t,x 
mod 1,1/)} and h{x,y) ~ l,a{x,y) = a{y). Then for each function fci(y), 
the metric 

g - -dt^ + e^''(l -h tki)‘^dx^ + (1 + ^(^1 + {kiY /a^ffdy^ 
is a solution with 

_ ki{{ki -h dyki/dyo) 4- 6^62 • 62 • e^^ 

(1 -h tki)(l + t{ki -h (kiY /a')) 

Let T := {t,x,y\t = 0,y — 0} and S = Snu^ ® be the second 
fundamental form of T C {t = 0}. Then up to a positive factor, the 
expansions 9 ^ are given by 6 ^^ - 7 : 5n. It follows that T is a closed 

trapped surface if fci(O) < -|5n|. This can always be arranged since 
ki can be freely specified. The spacetime (M,g) has a singularity at 
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y = 0, t = — l/fci(0). However, this singularity is so weak that it satisfies 
Vol(W) < 00 fy \e\iJ,M < 00 . I 

Corollary 9.5.5 implies that Conjecture 9.5.2 without the genericity con- 
ditions does not hold. In order to estimate the physical relevance of our 
example we have to examine its special features. 

(i) The genericity condition is not satisfied along the geodesic t 

Its usage in the proof of Theorem 9.3.1 is to ensure the 
existence of a singularity along it. Since there actually develops a 
singularity at 1 + tki{0) = 0, the failure of (M,y) to satisfy the 
genericity condition does not appear to be grave. Moreover, it seems 
very likely that a perturbation of (M, g) through dust spacetimes 
which are not strictly irrational will not suffer from this defect. On 
the other hand, we don’t know much about the global properties of 
these perturbed spacetimes. In particular, at this point of time^^ we 
cannot exclude the possibility that they form stronger singularities 
then the special spacetimes we have examined. 

(ii) Our example is 3-dimensional rather than 4-dimensional. Here it is 
important to note that Theorem 9.3.1 does hold for 3-dimensional 
spacetimes as well as for 4-dimensional ones. Moreover, there do 
exist 4-dimensional, spherically symmetric dust spacetimes which 
have similar singularities (Muller zum Hagen, Yodzis, and Seifert 
1974). However, these 4-dimensional examples also contain much 
stronger singularities in the centre of symmetry. One may speculate 
whether these strong (central) singularities are a typical feature for 
4-dimensional spacetimes. In the absence of independent evidence 
disqualifying 3-dimensional models it seems fair to state that our 
example indicates otherwise. 

(iii) The closed trapped surface we have constructed has not much to 
do with the existence of singularities. In fact, the general solution 
shows that the singularities depend solely on the principal pressures 
fcij This indicates that the example is more appropriate to il- 
lustrate condition (iii)(b) of Theorem 9.3.1 rather than condition 
(iii) (a). However, the hyperbolic nature of Equation (9.5.8) greatly 
restricts the existence of global solutions with compact hypersur- 
face U. Still, as an immediate consequence of Corollary 9.5.3 we 
have the existence of a 3-dimensional dust spacetime for any given 
2-dimensional Riemannian manifold {S^gs) by choosing k — \gs, 
where A is a constant. The energy density e is positive if |A| is suf- 
ficiently large. 

Our construction of closed trapped surface requires (at least) a pe- 
riodicity with respect to x. Observe that perturbation of our initial 
data are very restricted since the differential Equation (9.5.8) does 



11 



I am writing this in 1998 
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not need to respect this artificial periodicity. It is possible, however, 
to construct closed trapped surfaces with are stable with respect to 
arbitrary perturbations of the initial data. Choose a 2-dimensionai 
Riemannian manifold {S^gs) and a closed curve T C UAf X «1 
is small enough and k ~ Xgs then T is a closed trapped surface. 
This construction does not rely on periodicity and is therefore stable 
with respect to perturbations of initial conditions. 

The trapped surface has topology which is qualitatively different 
from 6'^ and more akin to the torus x 5^ This is inevitable 
if one works with 3-dimensional spacetimes. Unlike in the case of 
Conjecture 9.4.1 there there does not seem to exist evidence that 
the topology of the closed trapped surface matters in our context, 
(iv) One can argue that dust, arising as an idealisation from the energy 
momentum tensor for collisionless gas, is not a very realistic matter 
model. Moreover, even in the corresponding Newtonian theory, con- 
gruences of dust tend to form weak singularities. (Rein, Rendall, and 
Schaeffer 1995) has shown for spherically symmetric 4-dimensional 
spacetimes representing a collisionless gas that one does not obtain 
weak singularities before a central singularity has formed. This is 
in striking contrast to the analogous situation in the case of dust 
(Muller zum Hagen, Yodzis, and Seifert 1974). One may therefore 
be tempted to disregard our example as typical for a notoriously ill 
behaved matter model. However, since in our class of examples we 
only obtain weak singularities, which are forced on us by the singu- 
larity Theorem 9.3.1^^, we are still led to conclude that singularities 
(mathematically) due to the singularity theorems may be very weak 
and completely different from what one may expect at first sight. 

Hence whether or not our example is physically realistic, it indicates that 
existing singularity theorems are not sufficient to conclude the existence 
of black holes or the big bang. 

One additional, physically motivated assumption could be that the 
principal pressures diverge comparably to the energy density. It would 
therefore be of interest to study a similar example with T — (1 -j- P)eco (§) 
uj -(- peg (/? € M). For these 3-dimensional spacetimes there exist coordi- 
nates (t,x,y) as above: 

g ^ -dt^ + + W^dy^, 

T = edt‘^-hpe{dx^-hdy‘^), 

We have some control over the location of the singularities (they must 
occur before proper time 2/{ki 4- ^ 2 )? where fci, ^2 denote the principal 
curvatures of the initial hypersurface) . This class of solution may still 

Here we assume that in our case the violation of the genericity condition is 
irrelevant 
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be manageable and can give us an important clue as to whether the 
singularity theorems really give evidence for the existence of physical 
singularities. 
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